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Preface 


This book is based on my previous book: Tensor Calculus Made Simple, where the de- 
velopment of tensor calculus concepts and techniques are continued at a higher level. In 
the present book, we continue the discussion of the main topics of the subject at a more 
advanced level expanding, when necessary, some topics and developing further concepts 
and techniques. The purpose of the present book is to solidify, generalize, fill the gaps and 
make more rigorous what have been presented in the previous book. 

Unlike the previous book which is largely based on a Cartesian approach, the formulation 
in the present book is largely based on assuming an underlying general coordinate system 
although some example sections are still based on a Cartesian approach for the sake of 
simplicity and clarity. The reader will be notified about the underlying system in the given 
formulation. We also provide a sample of formal proofs to familiarize the reader with the 
tensor techniques. However, due to the preset objectives and the intended size of the book, 
we do not offer comprehensive proofs and complete theoretical foundations for the provided 
materials although we generally try to justify many of the given formulations descriptively 
or by interlinking to related formulations or by similar pedagogical techniques. This may 
be seen as a more friendly method for constructing and establishing the abstract concepts 
and techniques of tensor calculus. 

The book is furnished with an index in the end of the book as well as rather detailed sets 
of exercises in the end of each chapter to provide useful revision and practice. To facilitate 
linking related concepts and sections, and hence ensure better understanding of the given 
materials, cross referencing, which is hyperlinked for the ebook users, is used extensively 
throughout the book. The book also contains a number of graphic illustrations to help 
the readers to visualize the ideas and understand the subtle concepts. 

The book can be used as a text for an introductory or an intermediate level course on 
tensor calculus. The familiarity with the materials presented in the previous book will be 
an advantage although it is not necessary for someone with a reasonable mathematical 
background. Moreover, the main materials of the previous book are absorbed within the 
structure of the present book for the sake of completeness and to make the book rather 
self-contained considering the predetermined objectives. I hope I achieved these goals. 
Taha Sochi 
London, August 2017 
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Nomenclature 


In the following list, we define the common symbols, notations and abbreviations which 
are used in the book as a quick reference for the reader. 


V nabla differential operator 

V. and V' covariant and contravariant differential operators 

Vf gradient of scalar f 

V-A divergence of tensor A 

VxA curl of tensor A 

V2, On, Vis Laplacian operator 

Vv, Ov; velocity gradient tensor 

, (subscript) partial derivative with respect to following index(es) 

; (subscript) covariant derivative with respect to following index(es) 


hat (e.g. Aj, E,) 


bar (e.g. u’, Aj) 


(2) 


an perpendicular to 

1D, 2D, 3D, nD  one-, two-, three-, n-dimensional 

6/dt absolute derivative operator with respect to t 

O; and V; partial derivative operator with respect to i‘” variable 
0.; covariant derivative operator with respect to i” variable 
[27, k] Christoffel symbol of 1% kind 

A area 

B, Bi; Finger strain tensor 

Bt, Be Cauchy strain tensor 

C curve 

CG of class n 

d, d; displacement vector 

det determinant of matrix 

dr differential of position vector 

ds length of infinitesimal element of curve 

do area of infinitesimal element of surface 

dt volume of infinitesimal element of space 

e; i? vector of orthonormal vector set (usually Cartesian basis set) 
€,, €9, €g basis vectors of spherical coordinate system 

Cpr, €r9,°** ,€g¢ unit dyads of spherical coordinate system 

Cp, Cg, Cz basis vectors of cylindrical coordinate system 

Cop, Cpd,*** @zz unit dyads of cylindrical coordinate system 

E, Fi; first displacement gradient tensor 

E;, E’ i’ covariant and contravariant basis vectors 

E i’ orthonormalized covariant basis vector 


physical representation or normalized vector 
transformed quantity 
inner or outer product operator 


Eq. /Eqs. 
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qi 
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Farle 
Rijn, Reg 
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S, Sif 
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T (superscript) 
T, 7; 

tr 
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Vv, Uj 
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Li, v 

i Xi 

L,Y, 2% 

Y, Vii 
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ie 
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bij, 09, 5 
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A, Aj; 


Equation /Equations 

determinant of covariant metric tensor 

metric tensor 

covariant, contravariant and mixed metric tensor or its components 
coefficients of covariant metric tensor 

coefficients of contravariant metric tensor 

scale factor for i*” coordinate 

if and only if 

Jacobian of transformation between two coordinate systems 
Jacobian matrix of transformation between two coordinate systems 
inverse Jacobian matrix of transformation 

length of curve 

normal vector to surface 

point 

k-permutations of n objects 


i’ coordinate of orthogonal coordinate system 


i’ unit basis vector of orthogonal coordinate system 
position vector 

Ricci curvature scalar 

Ricci curvature tensor of 1° and 24 kind 
Riemann-Christoffel curvature tensor of 1% and 2" kind 
coordinates of spherical coordinate system 

surface 

rate of strain tensor 

vorticity tensor 

time 

transposition of matrix 

traction vector 

trace of matrix 

i’ coordinate of general coordinate system 

velocity vector 

volume 

weight of relative tensor 

i’ Cartesian coordinate 

i‘ Cartesian coordinate of particle at past and present times 
coordinates of 3D space (mainly Cartesian) 

infinitesimal strain tensor 

rate of strain tensor 

Christoffel symbol of 2"¢ kind 

Kronecker delta tensor 

covariant, contravariant and mixed ordinary Kronecker delta 
generalized Kronecker delta in 2D, 3D and nD space 
second displacement gradient tensor 
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covariant relative permutation tensor in 2D, 3D and nD space 
contravariant relative permutation tensor in 2D, 3D and nD space 
covariant absolute permutation tensor in 2D, 3D and nD space 
contravariant absolute permutation tensor in 2D, 3D and nD space 
coordinates of plane polar coordinate system 

coordinates of cylindrical coordinate system 

stress tensor 

vorticity tensor 

region of space 


Chapter 1 
Preliminaries 


In this introductory chapter, we provide preliminary materials about conventions and 
notations as well as basic facts about tensors which will be needed in the subsequent parts 
of the book. The chapter is therefore divided into two sections: the first is about general 
conventions and notations used in the book, and the second is on general background 
about tensors. 


1.1 General Conventions and Notations 


In this section, we provide general notes about the main conventions and notations used in 
the present book. We usually use the term “tensor” to mean tensors of all ranks including 
scalars (rank-0) and vectors (rank-1). However, we may also use this term as opposite to 
scalar and vector, i.e. tensor of rank-n where n > 1. In almost all cases, the meaning 
should be obvious from the context. We note that in the present book all tensors of all 
ranks and types are assumed to be real quantities, i.e. they have real rather than imaginary 
or complex components. 

We use non-indexed lower case light face italic Latin letters (e.g. f and h) to label 
scalars, while we use non-indexed lower or upper case bold face non-italic Latin letters 
(e.g. a and A) to label vectors in symbolic notation. The exception to this is the basis 
vectors where indexed bold face lower or upper case non-italic symbols (e.g. e; and E’) are 
used. However, there should be no confusion or ambiguity about the meaning of any one 
of these symbols. We also use non-indexed upper case bold face non-italic Latin letters 
(e.g. A and B) to label tensors of rank > 1 in symbolic notation. Since matrices in 
this book are supposed to represent rank-2 tensors, they also follow the rules of labeling 
tensors symbolically by using non-indexed upper case bold face non-italic Latin letters. 
We note that in a few cases in the final chapter (see § 7.3) we used boldface and indexed 
light face Greek symbols to represent particular tensors, which are commonly labeled in 
the literature by these symbols, to keep with the tradition. 

Indexed light face italic Latin symbols (e.g. a; and Bi) are used in this book to denote 
tensors of rank > 0 in their explicit tensor form, i.e. index notation. Such symbols may 
also be used to denote the components of these tensors. The meaning is usually transparent 
and can be identified from the context if it is not declared explicitly. Tensor indices in this 
book are lower case Latin letters which may be taken preferably from the middle of the 
Latin alphabet (such as 7,7 and k) for the free indices and from the beginning of the Latin 
alphabet (such as a and b) for the dummy indices. We also use numbered indices, such 
as (i1,72,..-,%x), for this purpose when the number of tensor indices is variable. Numbers 
are also used as indices in some occasions (e.g. €,2) for obvious purposes such as making 
statements about particular components. 
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Partial derivative symbol with a subscript index (e.g. 0;) is used to denote partial 
differentiation with respect to the i’” variable, that: is: 


0 
= — 1 
a= (1) 


However,we should note that in this book we generalize partial derivative notation so that 
0; symbolizes partial derivative with respect to the u’ coordinate of general coordinate 
systems and not just Cartesian coordinates which are usually denoted by x; or x’. The 
type of coordinates, being Cartesian or general or otherwise, will be determined by the 
context which should be obvious in all cases. 

Similarly, we use partial derivative symbol with a twice-repeated index to denote the 
Laplacian operator, that is:!! 


Partial derivative symbol with a coordinate label subscript, rather than an index, is also 
used to denote partial differentiation with respect to that spatial variable. For instance: 


O 


is used to denote the partial derivative with respect to the radial coordinate r in spherical 
coordinate systems which are identified by the spatial variables (r,0,¢). It should be 
obvious that in notations like 0, the subscript is used as a label rather than an index and 
hence it does not follow the rules of tensor indices which will be discussed later (see § 1.2). 

Following the widely used convention, a subscript comma preceding a subscript index 
(e.g. A, ;) is used to denote partial differentiation with respect to the spatial coordinate 
which is indexed by the symbol that follows the comma. For example, f; and Aj i are 
used to represent the partial derivative of the scalar f and rank-2 tensor AJ” with respect 
to the i“ coordinate, that is: 


fi = Of AM, = OAM (4) 


We also follow the common convention of using a subscript semicolon preceding a subscript 
index (e.g. Axi.;) to symbolize the operation of covariant differentiation with respect to 
the i” coordinate (see § 5.2). The semicolon notation may also be attached to the normal 
differential operators for the same purpose. For example, V.; and 0.; symbolize covariant 
differential operators with respect to the i’” variable. 

In this regard, we should remark that more than one index may follow the comma and 
semicolon in these notations to represent multiple partial and covariant differentiation with 
respect to the indexed variables according to the stated order of the indices. For example, 
A’ ip is used to represent the mixed second order partial derivative of the tensor A’ with 
respect to the j and k“" coordinates, while Byi:km is used to represent the mixed second 


(This is the Cartesian form. For the other forms, the reader is referred to § 6. 
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order covariant derivative of the tensor Bj; with respect to the k’” and m coordinates, 
that is: 
A’ je = Or (354') Biiskm = Vim (VrBii) (5) 


We also note that in a few occasions superscripts, rather than subscripts, comma and semi- 
colon preceding superscript index (e.g. f and Ae ) are used to represent contravariant 
partial derivative and contravariant tensor derivative respectively. Superscripted differen- 
tial operators (e.g. 0’ and V*) are also used occasionally to represent these differential 
operators in their contravariant form. A matter related to tensor differentiation is that 
we follow the conventional notation £ to represent the intrinsic derivative, which is also 
known as the absolute derivative, with respect to the variable t along a given curve, as 
will be discussed in § 5.3. 

Due to the restriction that we impose of using real (as opposite to imaginary and com- 
plex) quantities exclusively in this book, all arguments of real-valued functions which are 
not defined for negative quantities, like square roots and logarithmic functions, are as- 
sumed to be non-negative by taking the absolute value, if necessary, without using the 
absolute value symbol. This is to simplify the notation and avoid potential confusion with 
the determinant notation. So, ,/g means \/|g| and In(g) means In(|gJ). 

We follow the summation convention which is widely used in the literature of tensor 
calculus and its applications. However, the summation symbol (i.e. ©) is used in a 
few cases where a summation operation is needed but the conditions of the summation 
convention do not apply or there is an ambiguity about them, e.g. when an index is 
repeated more than twice or when a summation index is not repeated visually because it is 
part of a squared symbol. In a few other cases, where a twice-repeated index that complies 
with the conditions of the summation convention does not imply summation and hence 
the summation convention do not apply, we clarified the situation by adding comments 
like “no sum on index”. We may also add a “no sum” comment in some cases where the 
conditions of the summation convention do not apply technically but the expression may 
be misleading since it contains a repetitive index, e.g. when both indices are of the same 
variance type in a general coordinate system such as g” or when one of the apparent 
indices is in fact a label for a scalar rather than a variable index such as |E;| or hj. 

All the transformation equations in the present book are continuous and real, and all the 
derivatives are continuous over their intended domain. Based on the well known continuity 
condition of differential calculus, the individual differential operators in the second (and 
higher) order partial derivatives with respect to different indices are commutative, that is: 


0,0; = 0,0; (6) 


We generally assume that this continuity condition is satisfied and hence the order of 
the partial differential operators in these mixed second order partial derivatives does not 
matter. 

We use vertical bars (i.e. |::]) to symbolize determinants and square brackets (i.e. [::]) to 
symbolize matrices. This applies when these symbols contain arrays of objects; otherwise 
they have their normal meaning according to the context, e.g. bars embracing a vector 
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such as |v| mean modulus of the vector. Also, we use indexed square brackets (such as [A]' 
and [V f],) to denote the i” component of vectors in their symbolic or vector notation. 
For tensors of higher rank, more than one index are used to denote their components, e.g. 
[A];; represents the 7j‘” component of the rank-2 tensor A. 

We finally should remark that although we generally talk about nD spaces, our main 
focus is the low dimensionality spaces (mostly 2D and 3D) especially with regard to coordi- 
nate systems and hence some of the statements may apply only to these low dimensionality 
spaces although the statements are given in the context of nD spaces. In most cases, such 
statements can be generalized simply by adding extra conditions or by a slight modification 
to the phrasing and terminology. 


1.2 General Background about Tensors 


A tensor is an array of mathematical objects (usually numbers or functions) which trans- 
forms according to certain rules under coordinates change. In an nD space, a tensor of 
rank-k has n* components which may be specified with reference to a given coordinate 
system. Accordingly, a scalar, such as temperature, is a rank-0 tensor with (assuming a 
3D space) 3° = 1 component, a vector, such as force, is a rank-1 tensor with 3! = 3 com- 
ponents, and stress is a rank-2 tensor with 3? = 9 components. In Figure 1 we graphically 
illustrate the structure of a rank-3 tensor in a 3D space. 

The n* components of a rank-k tensor in an nD space are identified by k distinct inte- 
ger indices (e.g. i,j,k) which are attached, according to the commonly-employed tensor 
notation, as superscripts or subscripts or a mix of these to the right side of the symbol 
utilized to label the tensor, e.g. Ajj,, A“7* and As, Each tensor index takes all the values 
over a predefined range of dimensions such as 1 to n in the above example of an nD space. 
In general, all tensor indices have the same range, i.e. they are uniformly dimensioned.!! 
When the range of tensor indices is not stated explicitly, it is usually assumed to range 
over the values 1, 2,3. However, the range must be stated explicitly or implicitly to avoid 
ambiguity. 

The characteristic property of tensors is that they satisfy the principle of invariance 
under certain coordinate transformations. Therefore, formulating the fundamental laws 
of physics in a tensor form ensures that they are form-invariant, and hence they are 
objectively representing the physical reality and do not depend on the observer and his 
coordinate system. Having the same form in different coordinate systems may also be 
labeled as being covariant although this term is usually used for a different meaning in 
tensor calculus, as will be explained in § 3.1.1. 

While tensors of rank-0 are generally represented in a common form of light face non- 
indexed italic symbols like f and h, tensors of rank > 1 are represented in several forms 
and notations, the main ones are the index-free notation, which may also be called the 
direct or symbolic or Gibbs notation, and the indicial notation which is also called the 


[2] This applies to the common cases of tensor applications, but there are instances (e.g. in differential 
geometry of curves and surfaces) of tensors which are not uniformly dimensioned because the tensor is 
related to two spaces with different dimensions such as a 2D surface embedded in a 3D space. 
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Figure 1: Graphical illustration of a rank-3 tensor Ajj, in a 3D space, i.e. each one of 
i,j,k ranges over 1, 2,3. 


index or component or tensor notation. The first is a geometrically oriented notation with 
no reference to a particular coordinate system and hence it is intrinsically invariant to the 
choice of coordinate systems, whereas the second takes an algebraic form based on compo- 
nents identified by indices and hence the notation is suggestive of an underlying coordinate 
system, although being a tensor makes it form-invariant under certain coordinate trans- 
formations and therefore it possesses certain invariant properties. The index-free notation 
is usually identified by using bold face non-italic symbols, like a and B, while the indicial 
notation is identified by using light face indexed italic symbols such as a’ and B;;. It is 
noteworthy that although rank-0 and rank-1 tensors are, respectively, scalars and vectors, 
not all scalars and vectors (in their generic sense) are tensors of these ranks. Similarly, 
rank-2 tensors are normally represented by square matrices but not all square matrices 
represent rank-2 tensors. 
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Tensors can be combined through common algebraic operations such as addition and 
multiplication. Tensor term is a product of tensors including scalars and vectors and may 
consist of a single tensor which can be regarded as a multiple of unity. Tensor expression 
is an algebraic sum of tensor terms which may be a trivial sum in the case of a single 
term. Tensor equality is an equality of two tensor terms and/or expressions. An index 
that occurs once in a tensor term is a free index while an index that occurs twice in a 
tensor term is a dummy or bound index. 

The order of a tensor is identified by the number of its indices. For example, A‘, is 
a tensor of order 3 while By, is a tensor of order 2. The order of the tensor normally 
identifies its rank as well and hence A‘, is of rank-3 and By, is of rank-2. However, when 
the operation of contraction of indices (see § 3.2.4) takes place once or more, the order of 
the tensor is not affected but its rank is reduced by two for each contraction operation. 
Hence, the order of a tensor is equal to the number of all of its indices including the dummy 
indices, while the rank is equal to the number of its free indices only. Accordingly, A?.,,,, 
is of order 6 and rank-2 while B%, is of order 3 and rank-1. We note that many authors 
follow different conventions such as using “order” as equivalent to what we call “rank”. 

Tensors whose all indices are subscripts, like A;;, are called covariant, while tensors whose 
all indices are superscripts, like A’, are called contravariant. Tensors with both types of 
indices, like A®””, are called mixed type. Subscript indices, rather than subscripted ten- 
sors, are also described as covariant and superscript indices are described as contravariant. 
The zero tensor is a tensor whose all components are zero. The unit tensor or unity tensor, 
which is usually defined for rank-2 tensors, is a tensor whose all elements are zero except 
those with identical values of all indices (e.g. Aj; or B*?) which are assigned the value 1. 

There are general rules that govern the manipulation of tensors and hence they should be 
observed in the handling of mathematical expressions and calculations of tensor calculus. 
One of these rules is that no tensor index is allowed to occur more than twice in a legitimate 
tensor term. However, we follow in this assertion the common literature of tensor calculus 
which represents the ordinary use of repeated indices in tensor terms. In fact, there are 
many instances in the literature of tensor calculus where indices are legitimately repeated 
more than twice in a single term. The bottom line is that as long as the tensor expression 
makes sense and the intention is clear, such repetitions should be allowed with no need to 
take special precautions like using parentheses as done by some authors. In particular, the 
forthcoming summation convention will not apply automatically in such cases although 
summation on such indices, if needed, can be carried out explicitly, by using the summation 
symbol ™, or by a special declaration of such intention similar to the summation convention 
which is usually restricted to the twice-repeated indices. 

Regarding the aforementioned summation convention, according to this convention which 
is widely used in the literature of tensor calculus including the present book, dummy 
indices imply summation over their range. More clearly, a twice-repeated variable (i.e. 
not numeric) index in a single term implies a sum of terms equal in number to the range 
of the repeated index. Hence, in a 3D space we have: 


Av = Ay + Aj! + A? (7) 
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while in a 4D space we have: 
By + Cig D® = Bp Cy D' + CeD? + CD? + Cu D* (8) 


We note that although the twice-repeated index should be in the same term for the sum- 
mation convention to apply, it does not matter if the two indices occur in one tensor or in 
two tensors, as seen in the last example where the summation convention applies to a. 

We should also remark that there are many cases in the mathematics of tensors where 
a repeated index is needed but with no intention of summation, such as using A;; or At 
to mean the components of the tensor A whose indices have identical value like A,,; and 
A3. So to avoid confusion, when the dummy indices in a particular case do not imply 
summation, the situation must be clarified by enclosing such indices in parentheses or by 
underscoring or by using upper case letters with declaration of these conventions, or by 
adding a clarifying comment like “no summation over repeated indices”. These precautions 
are obviously needed if the summation convention is adopted in general but it does not 
apply in some exceptional cases where repeated indices are needed in the notation with 
no implication of summation. 

Another rule of tensors is that a free index should be understood to vary over its range 
(e.g. 1,...,7) which is determined by the space dimension and hence it should be inter- 
preted as saying “for all components represented by the index”. Therefore, a free index 
represents a number of terms or expressions or equalities equal to the number of the al- 
lowed values of its range. For example, when 7 and j can vary over the range 1,...,n then 
the expression A; + B; represents n separate expressions while the equation Al = B} rep- 
resents n xX n separate equations which represent the combination of all possible n values 
of 2 with all possible n values of j, that is: 


Ay Bi; Ag Bop oe Ags -B, (9) 


Al= Bi, A?=B}, Al= Bi. A= Be (10) 


Also, each tensor index should conform to one of the forthcoming variance transforma- 
tion rules as given by Eqs. 71 and 72, i.e. it is either covariant or contravariant. For 
orthonormal Cartesian coordinate systems, the two variance types (i.e. covariant and con- 
travariant) do not differ because the metric tensor is given by the Kronecker delta (refer 
to § 4.1 and 4.5) and hence any index can be upper or lower although it is common to use 
lower indices in this case. We note that orthonormal vectors mean a set of vectors which 
are mutually orthogonal and each one is of unit length, while orthonormal coordinate 
system means a coordinate system whose basis vector set is orthonormal at all points of 
the space where the system is defined (see § 2.6). The orthonormality of vectors may be 
expressed mathematically by: 


where the indexed 6 is the Kronecker delta symbol and the indexed V symbolizes a vector 
in the set. 
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In this context, we should remark that for tensor invariance, a pair of dummy indices 
involved in summation should in general be complementary in their variance type, i.e. 
one covariant and the other contravariant. However, for orthonormal Cartesian systems 
the two variance types are the same and hence when both dummy indices are covariant 
or both are contravariant it should be understood as an indication that the underlying 
coordinate system is orthonormal Cartesian if the possibility of an error is excluded. 

As indicated earlier, tensor order is equal to the number of its indices while tensor rank is 
equal to the number of its free indices. Hence, scalars (terms, expressions and equalities) 
have no free index since they are of rank-0, and vectors have a single free index while 
rank-2 tensors have exactly two free indices. Similarly, rank-n tensors have exactly n 
free indices. The dimension of a tensor is determined by the range taken by its indices 
which represents the number of dimensions of the underlying space. For example, in a 3D 
space the tensor Al is of rank-2 because it possesses exactly two free indices but it is of 
dimension three since each one of its free indices range over the values 1, 2,3 and hence it 
may be represented by a 3 x 3 matrix. However, in a 4D space this rank-2 tensor will be 
represented by a 4 x 4 matrix since its two free indices range over 1, 2, 3, 4. 

The rank of all terms in legitimate tensor expressions and equalities must be the same 
and hence: 

Aj — B} and A= (12) 
are legitimate but: 

Ai — B; and Ai =Ci (13) 
are illegitimate. Moreover, each term in valid tensor expressions and equalities must have 
the same set of free indices (e.g. 7, 7,k) and hence: 


Aj* — BiF and AO (14) 
are legitimate but: 
Ai* — Bim and ALO), (15) 


are illegitimate although they are all of the same rank. 

Also, a free index should keep its variance type in every term in valid tensor expressions 
and equations, i.e. it must be covariant in all terms or contravariant in all terms, and 
hence: 

Amt _ pms and Aa (16) 
are legitimate but: 
Al — Bi and Al =c” (17) 


are illegitimate although they are all of the same rank and have the same set of free 
indices. We also note that the order of the tensor indices in legitimate tensor expressions 
and equalities should be the same and hence: 


Am + Chm and D), = E!, (18) 
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are legitimate but: 
A‘, + Ch and De = he (19) 


are illegitimate although they are all of the same rank, have the same set of free indices 
and have the same variance type. 
We remark that expressions and equalities like: 


An + De and Ai’ = Be = Ge (20) 


are common in the literature of tensor calculus and its applications which may suggest that 
the order of indices (or another one of the aforementioned features of the indicial structure) 
in tensor expressions and equalities is not important. However, expressions and equalities 
like these refer to the individual components of these tensors and hence they are of scalar, 
rather than tensor, nature. Hence the expression A;; + Bj; means adding the value of the 
component A;; of tensor A to the value of the component 6;; of tensor B and not adding 
the tensor A to the tensor B. Similarly, the equality Aj’ = Bt — Ci! means subtracting 
the value of the component On of tensor C from the value of the component Be of tensor 
B to obtain the value of the component Ai of tensor A. We may similarly write things 
like A; = B’ or A; = B; to mean the equality of the values of these components and not 
tensor equality. 

As we will see (refer to § 3.1.1), the indicial notation of tensors is made with reference to 
a set of basis vectors. For example, when we write A‘ iy as a tensor we mean A‘, BiB! EX, 
This justifies all the above rules about the indicial structure (rank, set of free indices, 
variance type and order of indices) of tensor terms involved in tensor expressions and 
equalities because this structure is based on a set of basis vectors in a certain order. 
Therefore, an expression like A‘,, + B’,, means A',,E,E/E* + Bi, .E,E/E* and an equation 
like A‘,, = B',, means A‘,,E,E/E* = Bi, .E,EVE*. However, as seen above it is common 
in the literature of tensor calculus that the tensor notation like A',, is used to label the 
components and hence the above rules are not respected (e.g. By; +C;; or €;; = €'”) because 
these components are scalars in nature and hence these expressions and equalities do not 
refer to the vector basis. In this context, we should remark that an additional condition 
may be imposed on the indicial structure that is all the indices of a tensor should refer to 
a single vector basis set and hence a tensor cannot be indexed in reference to two different 
basis sets simultaneously.!] 

While free indices should be named uniformly in all terms of tensor expressions and 
equalities, dummy indices can be named in each term independently and hence: 


At, + Be, + CE, and Di = Ef + FR (21) 


[3] A tensor of different dimensionality (as in differential geometry) may not be compliant with this condi- 
tion and hence we may add an extra condition that all the indices should refer to a single vector basis 
set for any particular space. Alternatively, a tensor like this can be seen as a tensor in reference to each 
individual type of indices separately but it is not a tensor simultaneously to both types of indices where 
the simultaneously condition takes care of this. 
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are legitimate. The reason is that a dummy index represents a sum in its own term with 
no reach or presence into other terms. Despite the above restriction on the free indices, a 
free index in an expression or equality can be renamed uniformly and thoroughly using a 
different symbol, as long as this symbol is not already in use, assuming that both symbols 
vary over the same range, i.e. they have the same dimension. For example, we can change: 


Aj* + B} to AB (22) 
and change: 


as long as 7 and p have the same range and p is not already in use as an index for 
another purpose in that context. As indicated, the change should be thorough and hence 
all occurrences of the index 7 in that context, which may include other expressions and 
equalities, should be subject to that change. Regarding the dummy indices, they can be 
replaced by another symbol which is not present (as a free or dummy index) in their term 
as long as there is no confusion with a similar symbol in that context. 

Indexing is generally distributive over the terms of tensor expressions and equalities. For 
example, we have: 


[A +B], = [A], + [BI 


a 


(24) 


and . 
[A=B), <= [Al 


) 


= [B, 


(25) 
Unlike scalars and tensor components, which are essentially scalars in a generic sense, 
operators cannot in general be freely reordered in tensor terms. Therefore, we have the 


following legitimate equalities: 
fh=hf and A,B? = BIA; (26) 
but we cannot equate 0;A; to A,0O; since in general we have: 
0A; # A;O; (27) 


This should be obvious because 0;A; means that 0; is operating on A; but A;O; means 
that O; is operating on something else and A; just multiplies the result of this operation. 

As seen above, the order of the indices!4! of a given tensor is important and hence it 
should be observed and clarified, because two tensors with the same set of free indices and 
with the same indicial structure that satisfies the aforementioned rules but with different 
indicial order are not equal in general. For example, Ajj, is not equal to Aj, unless A 
is symmetric with respect to the indices i and j (refer to § 3.1.5). Similarly, B™” is not 
equal to B’”” unless B is symmetric in its indices |] and m. The confusion about the order 
of indices occurs specifically in the case of mixed type tensors such as A‘, which may not 


4] The “order” here means “arrangement” and hence it should not be confused with the order of tensor as 
defined above in the same context as tensor rank. 
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be clear since the order can be ijk or gik or jkt. Spaces are usually used in this case to 
clarify the order. For example, the latter tensor is symbolized as A’ ,, if the order of the 
indices is ijk, and as A; ‘if the order of the indices is jik while it is symbolized as A;,’ 
if the order of the indices is jk2. Dots may also be used in such cases to indicate, more 
explicitly, the order of the indices and remove any ambiguity. For example, if the indices 
i,j,k of the tensor A, which is covariant in 7 and k and contravariant in j, are of that 
order, then A may be symbolized as As , Where the dot between 7 and k indicates that 7 
is in the middle. 

We note that in many places in this book (like many other books of tensor calculus) and 
mostly for the sake of convenience in typesetting, the order of the indices of mixed type 
tensors is not clarified by spacing or by inserting dots. This commonly occurs when the 
order of the indices is irrelevant in the given context (e.g. any order satisfies the intended 
purpose) or when the order is clear. Sometimes, the order of the indices may be indicated 
implicitly by the alphabetical order of the selected indices, e.g. writing al to mean AS, 

Finally, scalars, vectors and tensors may be defined on a single point of the space or on 
a set of separate points. They may also be defined over an extended continuous region 
(or regions) of the space. In the latter case we have scalar fields, vector fields and tensor 
fields, e.g. temperature field, velocity field and stress field respectively. Hence, a “field” 
is a function of coordinates which is defined over a given region of the space. As stated 
earlier, “tensor” may be used in a general sense to include scalar and vector and hence 
“tensor field” may include all the three types. 


1.3. Exercises and Revision 


1.1 Differentiate between the symbols used to label scalars, vectors and tensors of rank 
eae 

1.2 What the comma and semicolon in A? : and A;.; mean? 

1.3 State the summation convention and explain its conditions. To what type of indices 
this convention applies? 

1.4 What is the number of components of a rank-3 tensor in a 4D space? 

1.5 A symbol like But may be used to represent tensor or its components. What is the 
difference between these two representations? Do the rules of indices apply to both 
representations or not? Justify your answer. 

1.6 What is the meaning of the following symbols: V, 0;, Ore, V7, 04; hie; Ass Or vr 
and Ci.km? 

1.7 What is the difference between symbolic notation and indicial notation? For what 
type of tensors these notations are used? What are the other names given to these 
types of notation? 

1.8 “The characteristic property of tensors is that they satisfy the principle of invariance 
under certain coordinate transformations”. Does this mean that the components of 
tensors are constant? Why this principle is very important in physical sciences? 

1.9 State and explain all the notations used to represent tensors of all ranks (rank-0, 
rank-1, rank-2, etc.). What are the advantages and disadvantages of using each one 
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1:20 


1.21 


22 


1.23 


1.24 


1.25 


1.26 
1.27 


of these notations? 
State the continuity condition that should be met if the equality: 0,0; = 0;0; is to be 
correct. 
Explain the difference between free and bound tensor indices. Also, state the rules 
that govern each one of these types of index in tensor terms, expressions and equalities. 
Explain the difference between the order and the rank of tensors and link this to the 
free and dummy indices. 
What is the difference between covariant, contravariant and mixed type tensors? Give 
an example for each. 
What is the meaning of “unit” and “zero” tensors? What is the characteristic feature 
of these tensors with regard to the value of their components? 
What is the meaning of “orthonormal vector set” and “orthonormal coordinate sys- 
tem”? State any relevant mathematical condition. 
What is the rule that governs the pair of dummy indices involved in summation 
regarding their variance type in general coordinate systems? Which type of coordinate 
system is exempt of this rule and why? 
State all the rules that govern the indicial structure of tensors involved in tensor 
expressions and equalities (rank, set of free indices, variance type and order of indices). 
How many equalities that the following equation contains assuming a 4D space: BF = 
C? Write all these equalities explicitly, ic. B} = Cl, B? = CZ, etc. 
Which of the following tensor expressions is legitimate and which is not, giving detailed 
explanation in each case? 

ABs OCR De. oR. Se 


Cc 


Which of the following tensor equalities is legitimate and which is not, giving detailed 
explanation in each case? 


A? =B., D=SS+NP, 3a+%=J%, BE=Ck, B,=3e-D, 


Explain why the indicial structure (rank, set of free indices, variance type and order of 
indices) of tensors involved in tensor expressions and equalities are important referring 
in your explanation to the vector basis set to which the tensors are referred. Also 
explain why these rules are not observed in the expressions and equalities of tensor 
components. 

Why free indices should be named uniformly in all terms of tensor expressions and 
equalities while dummy indices can be named in each term independently? 

What are the rules that should be observed when replacing the symbol of a free index 
with another symbol? What about replacing the symbols of dummy indices? 

Why in general we have: 0;A; 4 Aj;O;? What are the situations under which the 
following equality is valid: 0,;A; = A,0;? 

What is the difference between the order of a tensor and the order of its indices? 

In which case Aj;x is equal to Ajxj? What about A;;, and Ara? 

What are the rank, order and dimension of the tensor A‘, in a 3D space? What about 


the scalar f and the tensor Agr from the same perspectives? 
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1.28 What is the order of indices in A,* ,,? Insert a dot in this symbol to make the order 
more explicit. 

1.29 Why the order of indices of mixed tensors may not be clarified by using spaces or 
inserting dots? 

1.30 What is the meaning of “tensor field”? Is A’ a tensor field considering the spatial 
dependency of A‘ and the meaning of “tensor”? 


Chapter 2 
Spaces, Coordinate Systems and ‘Trans- 
formations 


The focus of this chapter is coordinate systems, their types and transformations as well as 
some general properties of spaces which are needed for the development of the concepts 
and techniques of tensor calculus in the present and forthcoming chapters. The chapter 
also includes other sections which are intimately linked to these topics. 


2.1 Spaces 


A Riemannian space is a manifold characterized by the existing of a symmetric rank-2 
tensor called the metric tensor. The components of this tensor, which can be in covariant 
form gj; or contravariant form g’’, as well as mixed form Gs are continuous variable 
functions of coordinates in general, that is: 


5: = gij(u', u’,...,u”) (28) 
Ge SGT un te fecgl”) (29) 
gj = gj CA a) (30) 


where the indexed u symbolizes general coordinates. This tensor facilitates, among other 
things, the generalization of the concept of length in general coordinate systems where the 
length of an infinitesimal element of arc, ds, is defined by: 


ds)? = g;,du'du! 31 
j 


In the special case of a Euclidean space coordinated by an orthonormal Cartesian system, 
the metric becomes the identity tensor, that is: 
Gig = Oi g =o gj = 53 (32) 
More details about the metric tensor and its significance and roles will be given in § 4.5. 
The metric of a Riemannian space may be called the Riemannian metric. Similarly, the 
geometry of the space may be described as the Riemannian geometry. All spaces dealt 
with in the present book are Riemannian with well-defined metrics. As we will see, an 
nD manifold is Euclidean iff the Riemann-Christoffel curvature tensor vanishes identically 
(see § 7.2.1); otherwise the manifold is curved to which the general Riemannian geometry 
applies. In metric spaces, the physical quantities are independent of the form of descrip- 
tion, being covariant or contravariant, as the metric tensor facilitates the transformation 
between the different forms; hence making the description objective. 


22 
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A manifold, such as a 2D surface or a 3D space, is called “flat” if it is possible to find 
a coordinate system for the manifold with a diagonal metric tensor whose all diagonal 
elements are +1; the space is called “curved” otherwise. More formally, an nD space is 
described as flat space iff it is possible to find a coordinate system for which the length 
of an infinitesimal element of arc ds is given by: 


(ds)’ = Ca(du')’ + Co(du*)” +... + Cn(du®)? = De Gi(du')” (33) 


where the indexed ¢ are +1 while the indexed wu are the coordinates of the space. For the 
space to be flat (i.e. globally not just locally), the condition given by Eq. 33 should apply 
all over the space and not just at certain points or regions. 

An example of flat space is the 3D Euclidean space which can be coordinated by an 
orthonormal Cartesian system whose metric tensor is diagonal with all the diagonal ele- 
ments being +1. This also applies to plane surfaces which are 2D flat spaces that can be 
coordinated by 2D orthonormal Cartesian systems. Another example is the 4D Minkowski 
space-time manifold associated with the mechanics of Lorentz transformations whose met- 
ric is diagonal with elements of +1 (see Eq. 241). When all the diagonal elements of 
the metric tensor of a flat space are +1, the space and the coordinate system may be de- 
scribed as homogeneous (see § 2.2.3). All 1D spaces are Euclidean and hence they cannot 
be curved intrinsically, so twisted curves are curved only when viewed externally from the 
embedding space which they reside in, e.g. the 2D space of a surface curve or the 3D space 
of a space curve. This is because any curve can be mapped isometrically to a straight line 
where both are naturally parameterized by arc length. An example of curved space is the 
2D surface of a sphere or an ellipsoid since there is no possibility of coordinating these 
spaces with valid 2D coordinate systems that satisfy the above criterion. 

A curved space may have constant curvature all over the space, or have variable curvature 
and hence the curvature is position dependent. An example of a space of constant curvature 
is the surface of a sphere of radius R whose curvature (i.e. Riemannian curvature) is 
at each point of the surface. Torus and ellipsoid are simple examples of 2D spaces with 
variable curvature. Schur theorem related to nD spaces (n > 2) of constant curvature 
states that: if the Riemann-Christoffel curvature tensor (see § 7.2.1) at each point of a 
space is a function of the coordinates only, then the curvature is constant all over the 
space. Schur theorem may also be stated as: the Riemannian curvature is constant over 
an isotropic region of an nD (n > 2) Riemannian space. 

A necessary and sufficient condition for an nD space to be intrinsically flat is that the 
Riemann-Christoffel curvature tensor of the space vanishes identically. Hence, cylinders 
are intrinsically flat, since their Riemann-Christoffel curvature tensor vanishes identically, 
although they are curved as seen extrinsically from the embedding 3D space. On the other 
hand, planes are intrinsically and extrinsically flat. In brief, a space is intrinsically flat 
iff the Riemann-Christoffel curvature tensor vanishes identically over the space, and it is 
extrinsically (as well as intrinsically) flat 2ff the curvature tensor vanishes identically over 
the whole space. This is because the Riemann-Christoffel curvature tensor characterizes 
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the space curvature from an intrinsic perspective while the curvature tensor characterizes 
the space curvature from an extrinsic perspective. 

As indicated above, the geometry of curved spaces is usually described as the Riemannian 
geometry. One approach for investigating the Riemannian geometry of a curved manifold 
is to embed the manifold in a Euclidean space of higher dimensionality and inspect the 
properties of the manifold from this perspective. This approach is largely followed, for 
example, in the differential geometry of surfaces where the geometry of curved 2D spaces 
(twisted surfaces) is investigated by immersing the surfaces in a 3D Euclidean space and 
examining their properties as viewed from this external enveloping 3D space. Such an 
external view is necessary for examining the extrinsic geometry of the space but not its 
intrinsic geometry. A similar approach may also be followed in the investigation of surface 
and space curves. 


2.2 Coordinate Systems 


In simple terms, a coordinate system is a mathematical device, essentially of geometric 
nature, used by an observer to identify the location of points and objects and describe 
events in generalized space which may include space-time. In tensor calculus, a coordinate 
system is needed to define non-scalar tensors in a specific form and identify their com- 
ponents in reference to the basis set of the system. An nD space requires a coordinate 
system with n mutually independent variable coordinates to be fully described so that any 
point in the space can be uniquely identified by the coordinate system. We note that the 
coordinates are generally real quantities although this may not apply in some cases (see § 
2.2.3). 

As we will see in § 2.4, coordinate systems of 3D spaces are characterized by having 
coordinate curves and coordinate surfaces where the coordinate curves occur at the in- 
tersection of the coordinate surfaces.! The coordinate curves represent the curves along 
which exactly one coordinate varies while the other coordinates are held constant. Con- 
versely, the coordinate surfaces represent the surfaces over which exactly one coordinate is 
held constant while the other coordinates vary. At any point P in a 3D space coordinated 
by a 3D coordinate system, we have 3 independent coordinate curves and 3 independent 
coordinate surfaces passing through P. The 3 coordinate curves uniquely identify the 
set of 3 mutually independent covariant basis vectors at P. Similarly, the 3 coordinate 
surfaces uniquely identify the set of 3 mutually independent contravariant basis vectors at 
P. Further details about this issue will follow in § 2.4. 

There are many types and categories of coordinate system; some of which will be briefly 
investigated in the following subsections. The most commonly used coordinate systems are: 
Cartesian, cylindrical and spherical. The most universal type of coordinate system is the 
general coordinate system which can include any type (rectilinear, curvilinear, orthogonal, 
etc.). A subset of the general coordinate system is the orthogonal coordinate system which 
is characterized by having mutually perpendicular coordinate curves, as well as mutually 


[5lIn fact, these concepts can be generalized to nD spaces by generalizing the concepts of curves and 
surfaces. However, the main interest here and in the forthcoming sections is 3D spaces. 
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perpendicular coordinate surfaces, at each point in the region of space over which the 
system is defined and hence its basis vectors, whether covariant or contravariant, are 
mutually perpendicular. 

The coordinates of a system can have the same physical dimension or different physical 
dimensions. An example of the first is the Cartesian coordinate system, which is usu- 
ally identified by (x,y,z), where all the coordinates have the dimension of length, while 
examples of the second include the cylindrical and spherical systems, which are usually 
identified by (p, ¢, z) and (r,@,@) respectively, where some coordinates, like p and r, have 
the dimension of length while other coordinates, like ¢@ and 6, are dimensionless. We also 
note that the physical dimensions of the components and basis vectors of the covariant 
and contravariant forms of a tensor are generally different. 

In the following subsections we outline a number of general types and categories of 
coordinate systems based on different classifying criteria. These categories are generally 
overlapping and may not be exhaustive in their domain. 


2.2.1 Rectilinear and Curvilinear Coordinate Systems 


Rectilinear coordinate systems are characterized by the property that all their coordinate 
curves are straight lines and all their coordinate surfaces are planes, while curvilinear 
coordinate systems are characterized by the property that at least some of their coordinate 
curves are not straight lines and some of their coordinate surfaces are not planes (see § 
2.4). Consequently, the basis vectors of rectilinear systems are constant while the basis 
vectors of curvilinear systems are variable in general since their direction or/and magnitude 
depend on the position in the space and hence they are coordinate dependent. 

Rectilinear coordinate systems can be rectangular (or orthogonal) when their coordinate 
curves, as well as their coordinate surfaces, are mutually orthogonal such as the well 
known rectangular Cartesian system. They can also be oblique when at least some of 
their coordinate curves and coordinate surfaces do not satisfy this condition. Figure 2 is 
a simple graphic illustration of rectangular and oblique rectilinear coordinate systems in 
a 3D space. Similarly, curvilinear coordinate systems can be orthogonal, when the vectors 
in their covariant or contravariant basis set are mutually orthogonal at each point in the 
space, and can be non-orthogonal when this condition is not met. Rectilinear coordinate 
systems may also be labeled as affine or linear coordinate systems although the terminology 
is not universal and hence these labels may be used differently. 

As stated above, curvilinear coordinate systems are characterized by the property that 
at least some of their coordinate curves are not straight lines and some of their coordinate 
surfaces are not planes (see Figures 3 and 10). This means that some (but not all) of 
the coordinate curves of curvilinear coordinate systems can be straight lines and some 
(but not all) of their coordinate surfaces can be planes. This is the case in the cylindrical 
and spherical coordinate systems as we will see next. Also, the coordinate curves of 
curvilinear coordinate systems may be regularly shaped curves such as circles and may be 
irregularly shaped and hence they are generalized twisted curves. Similarly, the coordinate 
surfaces of curvilinear systems may be regularly shaped surfaces such as spheres and may 
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(b) 


Figure 2: The two main types of rectilinear coordinate systems in 3D spaces: (a) rectan- 
gular and (b) oblique. 


be irregularly shaped. 

Prominent examples of curvilinear coordinate systems are the cylindrical and spherical 
systems of 3D spaces. All the coordinate curves and coordinate surfaces of these systems 
are regularly shaped. As we will see (refer to § 2.4), in the cylindrical coordinate systems 
the p, @, z coordinate curves are straight lines, circles and straight lines respectively, while 
the p, ¢, z coordinate surfaces are cylinders, semi-planes and planes respectively. Similarly, 
in the spherical coordinate systems the r,6,@ coordinate curves are straight lines, semi- 
circles and circles respectively, while the r,@,@ coordinate surfaces are spheres, cones and 
semi-planes respectively. We note that an admissible coordinate transformation from a 
rectilinear system defines another rectilinear system if the transformation is linear, and 
defines a curvilinear system if the transformation is nonlinear. 


2.2.2. Orthogonal Coordinate Systems 


The characteristic feature of orthogonal coordinate systems, whether rectilinear or curvi- 
linear, is that their coordinate curves, as well as their coordinate surfaces, are mutually 
perpendicular at each point in their space. Hence, the vectors of their covariant basis set 
and the vectors of their contravariant basis set are mutually orthogonal. As a result, the 
corresponding covariant and contravariant basis vectors in orthogonal coordinate systems 
have the same direction and therefore if the vectors of these basis sets are normalized they 
will be identical, i.e. the normalized covariant and the normalized contravariant basis 
vector sets are the same. 

Prominent examples of orthogonal coordinate systems are rectangular Cartesian, cylin- 
drical and spherical systems of 3D spaces (refer to Figure 4). A necessary and sufficient 
condition for a coordinate system to be orthogonal is that its metric tensor is diagonal. 
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Figure 3: General curvilinear coordinate system in a 3D space and its covariant basis 
vectors E,, E and Ez (see § 2.6) as tangents to the shown coordinate curves at a particular 
point of the space P, where u!, u? and u? represent general coordinates. 


This can be inferred from the definition of the components of the metric tensor as dot 
products of the basis vectors (see Eq. 48) since the dot product involving two different 
vectors (i.e. E;-E, or E’-E/ with i 4 7) will vanish if the basis vectors, whether covariant 
or contravariant, are mutually perpendicular. 


2.2.3. Homogeneous Coordinate Systems 


When all the diagonal elements of a diagonal metric tensor of a flat space are +1, the 
coordinate system is described as homogeneous. In this case the length of line element ds 
of Eq. 33 becomes: 

(ds)? = du'du' (34) 


An example of homogeneous coordinate systems is the orthonormal Cartesian system of a 
3D Euclidean space (Figure 4 a). A homogeneous coordinate system can be transformed 
to another homogeneous coordinate system only by linear transformations. Moreover, 
any coordinate system obtained from a homogeneous coordinate system by an orthogonal 
transformation (see § 2.3.3) is also homogeneous. As a consequence of the last statements, 
infinitely many homogeneous coordinate systems can be constructed in any flat space. 

A coordinate system of a flat space can always be homogenized by allowing the coordi- 
nates to be imaginary. This is done by redefining the coordinates as: 


Ut = J/Gui (no sum over 7) (35) 


where ¢; = +1. The new coordinates U’ are real when ¢; = 1 and imaginary when ¢; = —1. 
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wb 
(b) Cylindrical coordinate system and its orthonormal basis vectors e,, eg, ez 


(c) Spherical coordinate system and its orthonormal basis vectors e,, eg, €¢ 


Figure 4: The three prominent orthogonal coordinate systems in 3D spaces: (a) orthonor- 
mal Cartesian, (b) cylindrical and (c) spherical. 
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Consequently, the length of line element ds will be given by: 
(ds)? = dU‘dU' (36) 


which is of the same form as Eq. 34. An example of a homogeneous coordinate system 
with some real and some imaginary coordinates is the coordinate system of a Minkowski 
4D space-time of the mechanics of Lorentz transformations. We note that homogenization 
in the above sense is based on an extension of the concept of homogeneity and it is mainly 
based on the definition of the length of line element. 


2.3. Transformations 


In general terms, a transformation from an nD space to another nD space is a corre- 
lation that maps a point from the first space (original) to a point in the second space 
(transformed) where each point in the original and transformed spaces is identified by 
n independent coordinates. 'To distinguish between the two sets of coordinates in the 
two spaces, the coordinates of the points in the transformed space may be notated with 
barred symbols like (u!,u?,...,a"), while the coordinates of the points in the original 
space are notated with unbarred similar symbols like (u',u?,...,u”). Under certain con- 
ditions, which will be clarified later, such a transformation is unique and hence an inverse 
transformation from the transformed space to the original space is also defined. 

Mathematically, each one of the direct and inverse transformations can be regarded as 
a mathematical correlation expressed by a set of equations in which each coordinate in 
one space is considered as a function of the coordinates in the other space. Hence, the 
transformations between the two sets of coordinates in the two spaces can be expressed 
mathematically in a generic form by the following two sets of independent relations: 

Ht (ww tl") ul =u'(a',v,...,u”) (37) 
where 7 = 1,2,...,n with n being the space dimension. The independence of the above 
relations is guaranteed iff the Jacobian of the transformation does not vanish at any point 
in the space (refer to the following paragraphs about the Jacobian). 

An alternative to the latter view of considering the transformation as a mapping be- 
tween two different spaces is to view it as a correlation relating the same point in the same 
space but observed from two different coordinate systems which are subject to a similar 
transformation. The following will be largely based on the latter view although we usually 
adopt the one which is more convenient in the particular context. As far as the notation 
is concerned, there is no fundamental difference between the barred and unbarred systems 
and hence the notation can be interchanged. We also note that the transformations con- 
sidered here, and in the present book in general, are between two spaces or coordinate 
systems of equal dimensions and hence we do not consider transformations between spaces 
or coordinate systems of different dimensions. Consequently, the Jacobian matrix of the 
transformation is always square and its determinant (i.e. the Jacobian) is defined. As 
indicated earlier, if the mapping from an original rectangular Cartesian system is linear, 
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the coordinate system obtained from such a transformation is called affine or rectilin- 
ear. Coordinate systems which are not affine are described as curvilinear although the 
terminology may differ between the authors. 

The following n x n matrix of n? partial derivatives of the unbarred coordinates with 
respect to the barred coordinates, where n is the space dimension, is called the “Jacobian 
matrix” of the transformation between the unbarred and barred systems: 


Our Bul... But 
a. — 2 =n 
te out Ou? oun (38) 
Ou™ Ou™ | Ou” 
oul = Ou? oun 
while its determinant: 
J = det(J) (39) 


is called the “Jacobian” of the transformation where the indexed wu and w are the coordinates 
in the unbarred and barred coordinate systems of the nD space. The Jacobian array 
(whether matrix or determinant) contains all the possible n? partial derivatives made of 
the different combinations of the u and u indices where the pattern of the indices in this 
array is simple, that is the indices of u in the numerator provide the indices for the rows 
while the indices of u@ in the denominator provide the indices for the columns. This labeling 
scheme may be interchanged which is equivalent to taking the transpose of the array. As 
it is well known, the Jacobian will not change by this transposition since the determinant 
of a matrix is the same as the determinant of its transpose, i.e. det(A) = det(A7). 

We note that all coordinate transformations in the present book are continuous, single 
valued and invertible. We also note that “barred” and “unbarred” in the definition of 
Jacobian should be understood in a general sense not just as two labels since the Jacobian 
is not restricted to transformations between two systems of the same type but labeled as 
barred and unbarred. In fact the two coordinate systems can be fundamentally different in 
nature such as orthonormal Cartesian and general curvilinear. The Jacobian matrix and 
determinant represent any transformation by the above partial derivative array system 
between two coordinate systems defined by two different sets of coordinate variables not 
necessarily as barred and unbarred. The objective of defining the Jacobian as between 
unbarred and barred systems is simplicity and generality. 

The transformation from the unbarred coordinate system to the barred coordinate system 
is bijective! iff J 40 at any point in the transformed region of the space. In this case, 
the inverse transformation from the barred to the unbarred system is also defined and 
bijective and is represented by the inverse of the Jacobian matrix, that is: 


i eo i (40) 


Consequently, the Jacobian of the inverse transformation, being the determinant of the 
inverse Jacobian matrix, is the reciprocal of the Jacobian of the original transformation, 


6] Bijective means the mapping is injective (one-to-one) and surjective (onto). 
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that is: i 
J=— Al 
J (41) 


As we remarked, there is no fundamental notational difference between the barred and 
unbarred systems and hence the labeling is rather arbitrary and can be interchanged. 
Therefore, the Jacobian may be notated as unbarred over barred or the other way around. 
The essence is that the Jacobian usually represents the transformation from an original 
system to another system while its inverse represents the opposite transformation although 
even this is not generally respected in the literature of mathematics and hence “Jacobian” 
my be used to label the opposite transformation. Yes, in a specific context when one of 
these is labeled as the Jacobian, the other one should be labeled as the inverse Jacobian to 
distinguish between the two opposite Jacobians and their corresponding transformations. 
However, there may also be practical aspects for choosing which is the Jacobian and which 
is the inverse since it is easier sometimes to compute one of these than the other and hence 
we start by computing the easier as the Jacobian (whether from original to transformed 
or the other way) followed by obtaining the reciprocal as the inverse Jacobian. Anyway, 
in this book we generally use “Jacobian” flexibly where the context determines the nature 
of the transformation." 

An admissible (or permissible or allowed) coordinate transformation may be defined 
generically as a mapping represented by a sufficiently differentiable set of equations plus 
being invertible by having a non-vanishing Jacobian (J # 0). More technically, a co- 
ordinate transformation is commonly described as admissible iff the transformation is 
bijective with non-vanishing Jacobian and the transformation function is of class C?.!6l 
We note that the C” continuity condition means that the function and all its first n partial 
derivatives do exist and are continuous in their domain. Also, some authors may impose 
a weaker continuity condition of being of class C?. 

An object that does not change by admissible coordinate transformations is described 
as “invariant” such as a true scalar (see § 3.1.2) which is characterized by its sign and 
magnitude and a true vector which is characterized by its magnitude and direction in space. 
Similarly, an invariant property of an object or a manifold is a property that is independent 
of admissible coordinate transformations such as being form invariant which characterizes 
tensors or being flat which characterize spaces under certain types of transformation. It 
should be noted that an invariant object or property may be invariant with respect to 
certain types of transformation but not with respect to other types of transformation and 
hence the term may be used generically where the context is taken into consideration for 


I7lWe may get rid of all these complications by using the term “Jacobian” to represent a system of 
partial derivatives whose objective is to transform from one coordinate system to another (regardless of 
anything else like system notation or being original and secondary) and hence both the “Jacobian” and 
its inverse are Jacobians in this general sense; the two will be distinguished from each other by stating 
from which system to which system the Jacobian transforms. Hence, we can say legitimately and with 
no ambiguity: the Jacobian from A to B and the Jacobian from B to A. These may be labeled as 
J(A— B) and J(B — A) with similar notation for the Jacobian matrix. This, we believe, can resolve 
all these issues and avoid confusion. 

8] The meaning of “admissible coordinate transformation” may vary depending on the context. 
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sensible interpretation. 

A product or composition of space or coordinate transformations is a succession of trans- 
formations where the output of one transformation is taken as the input to the next trans- 
formation. For example, a series of m transformations labeled as T; where 7 = 1,2,--- ,m 
may be applied sequentially onto a mathematical object O. This operation can be ex- 
pressed mathematically by the following composite transformation T;: 


POV 407, (0) (42) 


where the output of 7) in this notation is taken as the input to 7 and so forth until the 
output of T;,-1 is fed as an input to 7; in the end to produce the final output of T.. In 
such cases, the Jacobian of the product is the product of the Jacobians of the individual 
transformations of which the product is made, that is: 


, an a ES Oe (43) 


where J; in this notation is the Jacobian of the 7; transformation and J, is the Jacobian 
of Ti. 

The collection of all admissible coordinate transformations with non-vanishing Jacobian 
form a group. This means that they satisfy the properties of closure, associativity, iden- 
tity and inverse. Hence, any convenient coordinate system can be chosen as the point of 
entry since other systems can be reached, if needed, through the set of admissible trans- 
formations. This is one of the cornerstones of building invariant physical theories which 
are independent of the subjective choice of coordinate systems and reference frames. We 
remark that transformation of coordinates is not a commutative operation and hence the 
result of two successive transformations may depend on the order of these transforma- 
tions. This is demonstrated in Figure 5 where the composition of two rotations results in 
different outcomes depending on the order of the rotations. 

As there are essentially two different types of basis vectors, namely tangent vectors of 
covariant nature and gradient vectors of contravariant nature (see § 2.6 and 3.1.1), there 
are two main types of non-scalar tensors: contravariant tensors and covariant tensors 
which are based on the type of the employed basis vectors of the given coordinate system. 
Tensors of mixed type employ in their definition mixed basis vectors of the opposite type 
to the corresponding indices of their components. As we will see, the transformation 
between these different types is facilitated by the metric tensor of the given coordinate 
system (refer to § 4.5). 

In the following subsections, we briefly describe a number of types and categories of 
coordinate transformations. 


2.3.1 Proper and Improper Transformations 


Coordinate transformations are described as “proper” when they preserve the handedness 
(right- or left-handed) of the coordinate system and “improper” when they reverse the 
handedness. Improper transformations involve an odd number of coordinate axes inver- 
sions in the origin of coordinates. Inversion of axes may be called improper rotation while 


2.3.2 Active and Passive Transformations 33 


ay Y v & 


Figure 5: The composite transformations R,R, (top) and R,R, (bottom) where R, and 
R, represent clockwise rotation of > around the positive x and y axes respectively. 


ordinary rotation is described as proper rotation. Figure 6 illustrates proper and improper 
coordinate transformations of a rectangular Cartesian coordinate system in a 3D space. 


2.3.2 Active and Passive Transformations 


Transformations can be active, when they change the state of the observed object such as 
rotating the object in the space, or passive when they are based on keeping the state of 
the object and changing the state of the coordinate system which the object is observed 
from. In brief, the subject of an active transformation is the object while the subject of a 
passive transformation is the coordinate system. 


2.3.3. Orthogonal Transformations 


An orthogonal coordinate transformation consists of a combination of translation, rota- 
tion and reflection of axes. The Jacobian of orthogonal transformations is unity, that is 
J = +1. The orthogonal transformation is described as positive iff J = +1 and neg- 


191 This condition should apply even when the transformation includes a translation since the added con- 
stants that represent the translation in the transformation equations will vanish in the Jacobian matrix. 
However, there seems to be a different convention that excludes translation to be part of orthogonal 
transformations. There also seems to be another convention which restricts orthogonal transformations 
to translation and rotation. 
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Improper 


Figure 6: Proper and improper transformations of a rectangular Cartesian coordinate 
system in a 3D space where the former is achieved by a rotation of the coordinate system 
while the latter is achieved by a rotation followed by a reflection of the first axis in the 
origin of coordinates. The transformed systems are shown as dashed and labeled with 
upper case letters while the original system is shown as solid and labeled with lower case 
letters. 


ative iff J = —1. Positive orthogonal transformations consist solely of translation and 
rotation (possibly trivial ones as in the case of the identity transformation) while neg- 
ative orthogonal transformations include reflection, by applying an odd number of axes 
reversal, as well. Positive transformations can be decomposed into an infinite number of 
continuously varying infinitesimal positive transformations each one of which imitates an 
identity transformation. Such a decomposition is not possible in the case of negative or- 
thogonal transformations because the shift from the identity transformation to reflection 
is impossible by a continuous process. 


2.3.4 Linear and Nonlinear Transformations 


The characteristic property of linear transformations is that they maintain scalar multi- 
plication and algebraic addition, while nonlinear transformations do not. Hence, if T is a 
linear transformation then we have: 


T (aA +bB) =aT(A)£bT (B) (44) 


where A and B are mathematical objects to be transformed by T and a and 0 are scalars. 
As indicated earlier, an admissible coordinate transformation from a rectilinear system 
defines another rectilinear system if the transformation is linear, and defines a curvilinear 
system if the transformation is nonlinear. 
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2.4 Coordinate Curves and Coordinate Surfaces 


As seen earlier, coordinate systems of 3D spaces are characterized by having coordinate 
curves and coordinate surfaces where the coordinate curves represent the curves of mutual 
intersection of the coordinate surfaces.!!°| These coordinate curves and coordinate surfaces 
play a crucial role in the formulation and development of the mathematical structures of 
the coordinated space. The coordinate curves represent the curves along which exactly one 
coordinate varies while the other coordinates are held constant. Conversely, the coordinate 
surfaces represent the surfaces over which all coordinates vary except one which is held 
constant. In brief, the i*” coordinate curve is the curve along which only the i*” coordinate 
varies while the i” coordinate surface is the surface over which only the i” coordinate is 
constant. 

For example, in a 3D Cartesian system identified by the coordinates (x,y,z) the curve 
r(a) = (x,¢c2,c3), where cg and cs are real constants, is an x coordinate curve since x 
varies while y and z are held constant, and the surface r(x, z) = (%, co, z) is a y coordinate 
surface since x and z vary while y is held constant. Similarly, in a cylindrical coordinate 
system identified by the coordinates (p,¢,z) the curve r(@) = (c1,¢,c¢3), where c, is 
a real constant, is a @ coordinate curve since p and z are held constant while @ varies. 
Likewise, in a spherical coordinate system identified by the coordinates (r, 6, ¢) the surface 
r(6,) = (c1,9,@) is an r coordinate surface since r is held constant while 6 and ¢ vary. 

As stated before, coordinate curves represent the curves of mutual intersection of coordi- 
nate surfaces. This should be obvious since along the intersection curve of two coordinate 
surfaces, where on each one of these surfaces one coordinate is held constant, two coor- 
dinates will be constant and hence only the third coordinate can vary. Hence, in a 3D 
Cartesian coordinate system the x coordinate curves occur at the intersection of the y 
and z coordinate surfaces, the y coordinate curves occur at the intersection of the x and 
z coordinate surfaces, and the z coordinate curves occur at the intersection of the x and 
y coordinate surfaces (refer to Figure 7). Similarly, in a cylindrical coordinate system the 
p, @ and z coordinate curves occur at the intersection of the ¢,z, the p,z and the p,@ 
coordinate surfaces respectively (refer to Figure 8). Likewise, in a spherical coordinate 
system the r, 8 and @ coordinate curves occur at the intersection of the 6, ¢, the r,@ and 
the r,@ coordinate surfaces respectively (refer to Figure 9). 

In this context, we note that the three types of coordinate surface of a Cartesian sys- 
tem, or in fact any rectilinear coordinate system (see § 2.2.1), are planes and hence the 
three types of coordinate curves are straight lines (refer to Figure 7). As for cylindrical 
coordinate systems, the p coordinate surfaces are cylinders (and this maybe the reason 
behind labeling them as “cylindrical’), the ¢ coordinate surfaces are semi-planes and the z 
coordinate surfaces are planes. Hence, the p, ¢@ and z coordinate curves are straight lines, 
circles, and straight lines respectively (refer to Figure 8). Regarding spherical coordinate 
systems, the r coordinate surfaces are spheres, the 6 coordinate surfaces are cones and 
the @ coordinate surfaces are semi-planes. Hence, the r, 6 and @ coordinate curves are 


[101 Qur focus here is 3D spaces although these concepts can be generalized to nD spaces as indicated 
earlier. 
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Figure 7: Coordinate curves (CC) and coordinate surfaces (CS) in a 3D Cartesian coordi- 
nate system. 


straight lines, semi-circles, and circles respectively (refer to Figure 9). 

The coordinate surfaces of general curvilinear coordinate systems can vary in general and 
hence they are not represented by particular geometric shapes such as planes and spheres. 
As a result, the coordinate curves of these systems are general space curves which may not 
have regular geometric shapes such as straight line or circle. As stated earlier, orthogonal 
coordinate systems are characterized by having coordinate surfaces which are mutually 
orthogonal at each point of the space, and consequently their coordinate curves are also 
mutually orthogonal at each point of the space. Finally, we remark that the transformation 
relations of Eq. 37 are used in defining the set of coordinate surfaces and coordinate curves. 


2.5 Scale Factors 


Scale factors of a coordinate system are those factors which are required to multiply the 
coordinate differentials to obtain the distances traversed during a change in the coordinate 
of that magnitude. The scale factors are symbolized with hi, h2,...,h» where this nota- 
tion is usually used in orthogonal coordinate systems. For example, in the plane polar 
coordinate system represented by the coordinates (p,@), the scale factor of the second 
coordinate ¢ is p because p is the factor used to multiply the differential of the polar angle 
dé to obtain the distance L traversed by a change of magnitude d@ in the polar angle, 
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(b) @ coordinate curve with p and z coordinate surfaces 
z 


c 
(c) z coordinate curve with p and ¢ coordinate surfaces 


Figure 8: Coordinate curves (CC) and coordinate surfaces(CS) in cylindrical coordinate 
systems. 
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(c) @ coordinate curve with r and @ coordinate surfaces 


Figure 9: Coordinate curves (CC) and coordinate surfaces (CS) in spherical coordinate 
systems. 
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Table 1: The scale factors (h1,h2,h3) for the three most commonly used orthogonal co- 
ordinate systems in 3D spaces: orthonormal Cartesian, cylindrical and spherical. The 
squares of these entries and the reciprocals of these squares give the diagonal elements of 
the covariant and contravariant metric tensors, g;; and g’, respectively of these systems 
(see Eqs. 238-240). 


Cartesian (2, y, z) Cylindrical (p, ¢, z) Spherical (r, 0, ¢) 
hy 1 1 1 
hg il p if 
hs 1 1 r sind 


that is: L = pd. 

The scale factors are also used for other purposes such as normalizing the basis vectors 
and defining the components in the physical representation of vectors and tensors (refer 
to § 2.6 and 3.3). They are also used in the analytical expressions for length, area and 
volume in orthogonal coordinate systems, as described in § 4.6.8, 4.6.9 and 4.6.10. The 
scale factors for the Cartesian, cylindrical and spherical coordinate systems of 3D spaces 
are given in Table 1. 


2.6 Basis Vectors and Their Relation to the Metric and Jacobian 


The set of basis vectors in a given manifold plays a pivotal role in the theoretical con- 
struction of the geometry of the manifold, where these vectors are used in the definition 
and construction of essential concepts and objects such as the metric tensor of the space. 
The basis sets are defined at each regular point in the space and hence the vectors in the 
basis sets vary in general from one point to another, i.e. they are coordinate dependent. 

The vectors providing the basis set for a coordinate system, which are not necessarily of 
unit length or mutually orthogonal, can be of covariant type or contravariant type. The 
covariant basis vectors are defined as the tangent vectors to the coordinate curves, while 
the contravariant basis vectors are defined as the gradient of the space coordinates and 
hence they are perpendicular to the coordinate surfaces (refer to Figure 10). Formally, the 
covariant and contravariant basis vectors are defined respectively by: 


Or i i 
where r is the position vector in Cartesian coordinates (z',...,2"), u’ represents general 
coordinates, n is the space dimension and 7 = 1,--- ,n. As indicated above, the covariant 


and contravariant basis sets, E; and E’, in general coordinate systems are functions of 
coordinates, that is: 


Ee Ee (Gyn 3 ") BSE (a) ait) (46) 


Hence, the definitions of Eq. 45 apply to each individual point in the space where the 
coordinate curves and coordinate surfaces belong to that particular point. For example, 
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at any particular point P in a 3D space with a valid coordinate system we have three mu- 
tually independent coordinate curves and three mutually independent coordinate surfaces 
and hence we should have three mutually independent covariant basis vectors and three 
mutually independent contravariant basis vectors at P. 


u> CC 


Figure 10: The covariant and contravariant basis vectors of a general curvilinear coordinate 
system and the associated coordinate curves (CC) and coordinate surfaces (CS) at a given 
point P in a 3D space. 


Like other vectors, the covariant and contravariant basis vectors of a given coordinate 
system are related to each other, through the metric tensor of the system, by the following 
relations: 

E; = gijE’ BE’ = gE; (47) 
where the metric tensor in its covariant form g;; and contravariant form g’’ works as an 
index shifting operator to lower and raise the indices and hence change the variance type 
of the basis vectors. Thus, the transformation between the covariant basis set and the 
contravariant basis set of a particular coordinate system is facilitated by the metric tensor 
of that system. 

The basis vectors in their covariant and contravariant forms are related to the components 
of the metric tensor in its covariant and contravariant forms by the following relations: 


E;- E; = gi E’ . EF’ = g” (48) 
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The covariant and contravariant basis vectors are reciprocal basis systems, and hence we 
have: 
E; -E’ = 6; KE’. BE; = 6%; (49) 
where the indexed 6 is the Kronecker delta tensor (refer to § 4.1). As we will see in § 4.5, 
the Kronecker delta tensor represents the metric tensor in its mixed form. 
In a 3D space with a right handed coordinate system the two sets of basis vectors are 
linked by the following relations: 


ae E? x E? aa E? x E! ee Et << E? (50) 
‘El. (BE? x B3)’ ° EI. (E? x B3)’ ° E!. (EB? x EB) 
E E E E E E 
E! = 2X BK E? = 3 X% By E: = 1X Ba (51) 
E, - (E2 x Es) E, - (E2 x Es) E, - (Ey x Es) 


The pattern in these relations is very simple, that is the covariant vectors are represented 
by contravariant vectors and vice versa; moreover, the indices in the numerators on both 
sides represent even permutations of 1, 2,3. 
The relations in Eqs. 50 and 51 may be expressed in a more compact form as follows: 
E’ x EX ' E; x Ex 


E; —= 5 = E’ _— 
Ei. (E) x E*) E; - (E; x Ex) 


(52) 


where 2,7,k take respectively the values 1,2,3 and the other two cyclic permutations 
(i.e. 2,3, 1 and 3,1,2). It is worth noting that the magnitude of the scalar triple product 
E;-(E; x E;,) represents the volume of the parallelepiped formed by the vectors E;, E; and 
E;, while the magnitude of the scalar triple product E’- (E/ x E*) represents its reciprocal. 
We also note that the scalar triple product is invariant to a cyclic permutation of the 
symbols of the three vectors involved, i.e. E;-(E; x E;,) = E,-(E; x E;) = E;- (Ex x E,); 
in fact its magnitude is invariant even to non-cyclic permutations. Another important note 
is that the magnitudes of the basis vectors in orthogonal coordinate systems are related 
to the scale factors of the coordinate system by: 


1 


ie (53) 


|E;| = h; |E'| = 
where h; is the scale factor for the i*” coordinate (see § 2.5). 
Following an admissible coordinate transformation between unbarred and barred gen- 
eral coordinate systems, the basis vectors in these systems are related by the following 
transformation rules: 


Ow — - Oui 
E; = Bul Bs EK; = Bai (54) 
Ei = oe E/ E’ = on E/ (55) 


where the indexed u and wu represent the coordinates in the unbarred and barred systems 
respectively. The transformation rules for the components can be easily deduced from 


2.6 Basis Vectors and Their Relation to the Metric and Jacobian 42 


the above rules. For example, for a vector A which can be represented covariantly and 
contravariantly in the unbarred and barred systems as: 


A =E'A; = E'A; (56) 


the transformation equations of its components between the two systems are given respec- 
tively by: 


Ou! - - Ou 
eae A= GaAs 68) 
oz aa o ae as ee) 


These transformation rules can be easily extended to higher rank tensors of different 
variance types (see § 3.1.1). 
For a 3D manifold with a right handed coordinate system, we have: 


EF, - (Ey x E3) = 9 E? . (E? x E®) = wa (60) 


where g is the determinant of the covariant metric tensor, i.e. 


g = det (9:3) = | 95] (61) 
Because E; - E; = gi; (Eq. 48) we have:!!!) 
JTS = [9:3] (62) 


where J is the Jacobian matrix that transforms between Cartesian and general coordinates, 
the superscript T’ represents matrix transposition, [g;;| is the matrix representing the 
covariant metric tensor and the product on the left is a matrix product as defined in linear 
algebra which is equivalent to a dot product in tensor algebra. On taking the determinant 
of both sides of Eq. 62, the relation between the Jacobian and the determinant of the 
metric tensor is obtained, that is: 


g= J? (63) 


where J is the Jacobian of the transformation. As it should be known, the determinant 
of a product is equal to the product of the determinants, that is: 


det(J7J) = det(J7 )det(J) (64) 
[11] For example, for the transformation between Cartesian and general coordinate systems in 3D spaces 
we have: 
Oct Oa? Oa* Oat = Ow Bx" 
dul dul dul Out Ou2 dud 
ji = Oat Aa? Aa? ee Oa? = Ax? ~— Aa? 
Ou2 Ou? Our? Oui Ou? Ou 
Oct Oa? a? Ox Ox Bax? 
due Ous— Ou38 Out Ou? Ou 


The entries of the product of these matrices then correspond to the entries of the metric tensor as 
given by Eq. 216. 
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Moreover, the determinant of a matrix is equal to the determinant of its transpose, that 
is: 


det(J”) = det(J) = J (65) 


As explained earlier, in orthogonal coordinate systems the covariant and contravariant 
basis vectors, E; and E’, at any point of the space are in the same direction, and hence the 
normalization of each one of these basis sets, by dividing each basis vector by its magnitude, 
produces identical orthonormal basis sets. Consequently, there is no difference between 
the covariant and contravariant components of tensors with respect to such contravariant 
and covariant orthonormal basis sets. This, however, is not true in general coordinate 
systems where each normalized basis set is different in general from the other. 

When the covariant basis vectors E; are mutually orthogonal at each point of the space, 
the following consequences will follow: 

1. The contravariant basis vectors E* are also mutually orthogonal because the corre- 
sponding vectors of each basis set are in the same direction due to the fact that the 
tangent vector to the i” coordinate curve and the gradient vector of the i” coordi- 
nate surface at a given point in the space have the same direction. This may also be 
established more formally by Eq. 49 since the reciprocal of an orthogonal system of 
vectors should also be orthogonal. Eqs. 50 and 51 may also be used in this argument 
since the cross products in the numerators are orthogonal to the multiplicands. 

2. The covariant and contravariant metric tensors, g;; and g”, are diagonal with non- 
vanishing diagonal elements, that is:!"?! 


Gig = 0 g’ =0 (#3) (66) 
gi ZO g' #0 (no sum on 7) (67) 


This can be concluded from Eq. 48 since the dot product is zero when the indices are 
different due to orthogonality. Also, the dot product should be non-zero when the 
indices are identical because the basis vectors cannot vanish at the regular points of 
the space since the tangent to the coordinate curve and the gradient to the coordinate 
surface do exist and they cannot be zero. Moreover, the metric tensor (as we will see 
in § 4.5) is invertible and hence it does not vanish at any point in the space. This 
guarantees that none of the diagonal elements can be zero when the metric tensor 
is diagonal since the determinant, which is formed by the product of its diagonal 
elements, should not vanish. 

3. The diagonal elements of the covariant and contravariant metric tensors are recip- 
rocals, that is: 


g=— (no summation) (68) 


This is a result from the previous point (i.e. the metric tensor is diagonal with non- 
vanishing diagonal elements) since the covariant and contravariant metric tensors 


[12] This also applies to the mixed type metric tensor since it is the identity tensor according to Eq. 49 
although this is not restricted to orthogonal coordinate systems. 
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are inverses of each other. As it is well known from linear algebra, the inverse of a 
diagonal matrix with non-vanishing diagonal entries is a diagonal matrix obtained 
by taking the reciprocals of its diagonal elements. 

4. The magnitudes of the contravariant and covariant basis vectors are reciprocals, that 
is: 


1 1 
E‘} = — = — 69 
l= Ty i oa 
This result is based on the definition of vector modulus plus Eqs. 48, 68 and 53, 


that is: 


|E'| = VE’. E (Eq. 263) (70) 


= V9 (Eq. 48) 
1 

se (Eq. 68) 

1 

== (Eq. 48) 
ib 

= iE, (Eq. 264) 
ib 

=; (Eq. 53) 


with no sum over 7. 
The consequences that we stated in the above bullet points also follow when the con- 
travariant basis vectors E’ are mutually orthogonal with minor amendments required in 
the phrasing of the first point. 


2.7 Relationship between Space, Coordinates and Metric 


In this section, we try to shed some light on the relationship between space, coordinate 
system and metric tensor where in this discussion we need the concepts of basis vectors 
and coordinate curves and coordinate surfaces (see § 2.4 and 2.6).!'5 We start from the 
very abstract concept of space (say nD space) which may be visualized as a sort of vacuum 
in which mathematical objects can exist. The number n represents the dimensionality of 
the space which is a quantitative measure of the complexity of the space structure where 
an nD space requires exactly n mutually independent variables to be fully and uniquely 
described. So, any point in this nD space can be uniquely located by assigning an ordered 
set of n mutually independent variables which we call “coordinates” such as ut,--+ ,u”. 
These coordinates vary continuously throughout the space where each coordinate ranges 
over a given real interval which may be finite or infinite. All the points of the space are 
then uniquely identified by varying these variables over their predefined ranges. The way 


(131 The revelation in this section is highly informal and non-rigorous with a pedagogical objective of 
providing a qualitative appreciation of the relation between space, coordinate system and metric tensor 
which may be useful for some novice readers. 
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by which the points of space are identified by the ordered sets of n coordinates may be 
labeled generically as a “coordinate system” for the space. 

The condition of mutual independence of the coordinate variables is important to make 
the distinction between the points unique and universal and to avoid any ambiguity. Mu- 
tual independence requires that an nD space is identified by a coordinate system with 
exactly n variable coordinates. As well as the condition of mutual independence, there is 
another important condition for the coordinate system to be acceptable that is it should 
be universal in its ability to identify all the points of the space unambiguously, and hence 
the coordinate system should be thorough in its functionality with possible exclusion of a 
finite number of isolated points in the space. The “coordinate system” may then be defined 
more rigorously as a bijective correspondence between the set of n-tuple numbers and the 
points of the space. 

For the coordinate system to be of any use, the way by which the coordinates are assigned 
to the individual points should be clearly defined. So, each particular way of defining the 
aforementioned bijective correspondence represents a specific coordinate system associated 
with the space. In fact, there are infinitely many ways of choosing the coordinate variables 
and how they vary in the space, and hence there are infinitely many coordinate systems 
that can be chosen to fulfill the above objectives. However, the conditions of mutual 
independence of coordinates and bijectivity of mapping impose certain restrictions on the 
selection of coordinates and how they can vary to achieve the intended functionalities. 

As a result of the above conditions and restrictions on the coordinate variables and 
how they vary to ensure their mutual independence and universal validity at each point 
in the space, there are only certain allowed ways for the choice of the coordinate curves 
and coordinate surfaces. So, on excluding certain types of coordinate systems which do 
not meet these requirements, and hence excluding certain types of how the coordinate 
curves and surfaces are defined at the individual points of the space, we are left with only 
certain acceptable possibilities for identifying how the coordinates vary and hence how the 
coordinate curves and surfaces are identified. 

Now, since the covariant basis vectors are defined as the tangents to the coordinate 
curves and the contravariant basis vectors are defined as the gradients to the coordinate 
surfaces, then to each one of those acceptable possibilities there is only one possibility for 
the identification of the covariant and contravariant sets of basis vectors. As soon as the 
two sets of basis vectors of the selected coordinate system are identified unambiguously, 
the metric tensor in its covariant and contravariant forms is uniquely identified throughout 
the space through its relation to the basis vectors, i.e. Eq. 48, and hence the vagueness 
about the nature of the space, its geometry and how it should be described is removed 
thanks to the existence of a well defined metric tensor. 

In brief, we start by defining a coordinate system which leads to the definition of coor- 
dinate curves and coordinate surfaces (and hence covariant and contravariant basis vector 
sets) and this in its turn will lead to the definition of the metric tensor. In fact, this 
approach may be described as procedural since it is based on the practical procedure of 
defining these concepts. A more fundamental approach may start from the metric of the 
space as an essential property of the space that characterizes its nature from which the 
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basis vectors and the coordinate system are to be derived. It should be remarked that 
the coordinate system and the space metric are independent entities despite the close link 
between the two. The metric can be regarded as a built-in attribute that characterizes the 
space but it can be demonstrated in different forms depending on the employed coordinate 
system. 

In this context, it is important to distinguish between the intrinsic and extrinsic prop- 
erties of an nD space where the intrinsic properties refer to those properties of the space 
which are seen from inside the space by an inhabitant of the space who can perceive only 
the n dimensions of the space, while the extrinsic properties refer to those properties of 
the space which are seen from outside the space by an inhabitant of an mD space (m > n) 
that encloses the nD space where this mD inhabitant can perceive the n dimensions of the 
space plus the extra (m — n) dimensions of the mD space. While the intrinsic properties 
of the nD space are described and quantified by the metric of the nD space, the extrinsic 
properties of the space are described and quantified by the metric of the embedding mD 
space since the nD space is a subspace of the mD space and hence it is subject to the mD 
metric. An example of nD and mD spaces is a 2D surface, such as a plane or a sphere, 
which is embedded in a 3D Euclidean space. 


2.8 Exercises and Revision 


2.1 Give brief definitions to the following terms: Riemannian space, coordinate system 
and metric tensor. 

2.2 Discuss the main functions of the metric tensor in a given space. How many types 
the metric tensor can have? 

2.3 What is the meaning of “flat” and “curved” space? Give mathematical conditions for 
the space to be flat in terms of the length of an infinitesimal element of arc and in 
terms of the metric tensor. Why these conditions should be global for the space to be 
flat? 

2.4 Give common examples of flat and curved spaces of different dimensions justifying in 
each case why the space is flat or curved. 

2.5 Explain why all 1D spaces are Euclidean. 

2.6 Give examples of spaces with constant curvature and spaces with variable curvature. 

2.7 State Schur theorem outlining its significance. 

2.8 What is the condition for a space to be intrinsically flat and extrinsically flat? 

2.9 What is the common method of investigating the Riemannian geometry of a curved 
manifold? 

2.10 Give brief definitions to coordinate curves and coordinate surfaces outlining their 
relations to the basis vector sets. How many independent coordinate curves and 
coordinate surfaces we have at each point of a 3D space with a valid coordinate 
system? 

2.11 Why a coordinate system is needed in tensor formulations? 

2.12 List the main types of coordinate system outlining their relations to each other. 

2.13 “The coordinates of a system can have the same physical dimension or different phys- 
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2.14 
2.15 
Z16 
a 
2.18 
2.19 
2.20 
DPA 
222 
2.23 
2.24 
2.25 
2.26 
226 
228 
2.20 
2.30 


2.31 


2.32 


2.33 


2.34 
2.35 
2.36 
2.37 


2.38 


ical dimensions”. Give an example for each. 

Prove that spherical coordinate systems are orthogonal. 

What is the difference between rectilinear and curvilinear coordinate systems? 

Give examples of common curvilinear coordinate systems explaining why they are 
curvilinear. 

Give an example of a commonly used curvilinear coordinate system with some of its 
coordinate curves being straight lines. 

Define briefly the terms “orthogonal” and “homogeneous” coordinate system. 

Give examples of rectilinear and curvilinear orthogonal coordinate systems. 

What is the condition of a coordinate system to be orthogonal in terms of the form 
of its metric tensor? Explain why this is so. 

What is the mathematical condition for a coordinate system to be homogeneous? 
How can we homogenize a non-homogeneous coordinate system of a flat space? 

Give examples of homogeneous and non-homogeneous coordinate systems. 

Give an example of a non-homogeneous coordinate system that can be homogenized. 
Describe briefly the transformation of spaces and coordinate systems stating relevant 
mathematical relations. 

What “injective transformation” means? Is it necessary that such a transformation 
has an inverse? 

Write the Jacobian matrix J of a transformation between two nD spaces whose coor- 
dinates are labeled as u’ and w’ where i = 1,--- ,n. 

State the pattern of the row and column indices of the Jacobian matrix in relation to 
the indices of the coordinates of the two spaces. 

What is the difference between the Jacobian matrix and the Jacobian and what is the 
relation between them? 

What is the relation between the Jacobian of a given transformation and the Jacobian 
of its inverse? Write a mathematical formula representing this relation. 

Is the labeling of two coordinate systems (e.g. barred and unbarred) involved in a 
transformation relation essential or arbitrary? Hence, discuss if the labeling of the 
coordinates in the Jacobian matrix can be interchanged. 

Using the transformation equations between the Cartesian and cylindrical coordinate 
systems, find the Jacobian matrix of the transformation between these systems, i.e. 
Cartesian to cylindrical and cylindrical to Cartesian. 

Repeat question 2.32 for the spherical, instead of cylindrical, system to find the Ja- 
cobian this time. 

Give a simple definition of admissible coordinate transformation. 

What is the meaning of the C” continuity condition? 

What “invariant” object or property means? Give some illustrating examples. 

What is the meaning of “composition of transformations”? State a mathematical 
relation representing such a composition. 

What is the Jacobian of a composite transformation in terms of the Jacobians of the 
simple transformations that make the composite transformation? Write a mathemat- 
ical relation that links all these Jacobians. 
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2.39 


2.40 


2.41 


2.42 


2.43 


2.44 
2.45 
2.46 
2.47 
2.48 
2.49 
2.50 


2.51 


2.52 
2.53 


2.54 


2.59 


2.56 


2.07 


2.58 


“The collection of all admissible coordinate transformations with non-vanishing Ja- 
cobian form a group”. What this means? State your answer in mathematical and 
descriptive forms. 

Is the transformation of coordinates a commutative operation? Justify your answer 
by an example. 

A transformation 73 with a Jacobian J3 is a composite transformation, i.e. T3 = 7>T; 
where the transformations TJ) and Ty have Jacobians J; and Jy. What is the relation 
between J;, Jo and J3? 

Two transformations, R,; and Ro, are related by: R, Ry = I where I is the identity 
transformation. What is the relation between the Jacobians of R, and Ry? What we 
should call these transformations? 

Discuss the transformation of one set of basis vectors of a given coordinate system to 
another set of opposite variance type of that system and the relation of this to the 
metric tensor. 

Discuss the transformation of one set of basis vectors of a given coordinate system to 
another set of the same variance type of another coordinate system. 

Discuss and compare the results of question 2.43 and question 2.44. Also, compare 
the mathematical formulation that should apply in each case. 

Define proper and improper coordinate transformations. 

What is the difference between positive and negative orthogonal transformations? 
Give detailed definitions of coordinate curves and coordinate surfaces of 3D spaces 
discussing the relation between them. 

For each one of the following coordinate systems, what is the shape of the coordinate 
curves and coordinate surfaces: Cartesian, cylindrical and spherical? 

Make a simple plot representing the ¢ coordinate curve with the p and z coordinate 
surfaces of a cylindrical coordinate system. 

Make a simple plot representing the r coordinate curve with the @ and @ coordinate 
surfaces of a spherical coordinate system. 

Define “scale factors” of a coordinate system and outline their significance. 

Give the scale factors of the following coordinate systems: orthonormal Cartesian, 
cylindrical and spherical. 

Define, mathematically and in words, the covariant and contravariant basis vector 
sets explaining any symbols involved in these definitions. 

What is the relation of the covariant and contravariant basis vector sets with the 
coordinate curves and coordinate surfaces of a given coordinate system? Make a 
simple sketch representing this relation for a general curvilinear coordinate system in 
a 3D space. 

The covariant and contravariant components of vectors can be transformed one to the 
other. How? State your answer in a mathematical form. 

What is the significance of the following relations? 


E; -E’ = 6; E' -E; = 0°, 


Write down the mathematical relations that correlate the basis vectors to the compo- 
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2.59 


2.60 


2.61 


2.62 


2.63 


2.64 


nents of the metric tensor in their covariant and contravariant forms. 
Using Eq. 50, show that if E!, E?, E® is a right handed orthonormal system then E; = 
E’. Repeat the question using this time Eq. 51 where E,, E2,E3 form a right handed 
orthonormal system. Hence, conclude that when the covariant or contravariant basis 
vector set is orthonormal then the covariant and contravariant components of a given 
tensor are identical. 
State the mathematical relations between the original and transformed (i.e. unbarred 
and barred) basis vector sets in their covariant and contravariant forms under admis- 
sible coordinate transformations. 
Correct, if necessary, the following equations explaining all the symbols involved: 
1 1 2 3 
E, - (Ey x Es) a E (E x E*) V9 


Obtain the relation: g = J? from the relation: J7J = [g;;] giving full explanation of 
each step. 

State three consequences of having mutually orthogonal contravariant basis vectors 
at each point in the space justifying these consequences. 

Discuss the relationship between the concepts of space, coordinate system and metric 
tensor. 


Chapter 3 
Tensors 


The subject of this chapter is tensors in general. The chapter is divided into three sections 
where we discuss in these sections: tensor types, tensor operations, and tensor represen- 
tations. 


3.1 Tensor Types 


In the following subsections, we introduce a number of tensor types and categories and 
highlight their main characteristics and differences. These types and categories are not 
mutually exclusive and hence they overlap in general. Moreover, they may not be exhaus- 
tive in their classes as some tensors may not instantiate any one of a complementary set 
of types such as being symmetric or anti-symmetric. 


3.1.1 Covariant and Contravariant Tensors 


These are the main types of tensor with regard to the rules of their transformation between 
different coordinate bases. Covariant tensors are notated with subscript indices (e.g. Aj) 
while contravariant tensors are notated with superscript indices (e.g. A”). A covariant 
tensor is transformed according to the following rule: 


- Oui 
Oui J ( ) 
while a contravariant tensor is transformed according to the following rule: 
_ Oy. 
t#_ As 72 
Fal (72) 


The barred and unbarred symbols in these equations represent the same mathematical 
object (tensor or coordinate) in the transformed and original coordinate systems respec- 
tively. An example of covariant tensors is the gradient of a scalar field while an example 
of contravariant tensors is the displacement vector. 

The above transformation rules, as demonstrated by Eqs. 71 and 72, which correspond 
to rank-1 tensors can be easily extended to tensors of any rank following the above pattern. 
Hence, a covariant tensor A of rank m is transformed according to the following rule: 

& Ou? Out Ou" 

Asjom = dui Oui am Pr (73) 
Similarly, a contravariant tensor B of rank n is transformed according to the following 
rule: 


Bgoac, (OU OU OU ic. 
is du dul Ou" oe “ 
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Some tensors of rank > 1 have mixed variance type, i.e. they are covariant in some 
indices and contravariant in others. In this case the covariant variables are indexed with 
subscripts while the contravariant variables are indexed with superscripts, e.g. Al which 
is covariant in 7 and contravariant in 7. A mixed type tensor transforms covariantly in its 
covariant indices and contravariantly in its contravariant indices, e.g. 

jin _ OU dw OW 4, 4 
eS  OUEOU OU A 

In brief, tensors of rank-n (n > 0) whose all free indices are subscripts are covariant, 
and tensors of rank-n (n > 0) whose all free indices are superscripts are contravariant, 
while tensors of rank-n (n > 1) whose some of free indices are subscripts while other 
free indices are superscripts are mixed. Hence, A; and B,; are covariant and transform 
according to Eq. 73, A’ and B’* are contravariant and transform according to Eq. 74, 
and A? and B!* are mixed and transform according to Eq. 75. As indicated before, for 
orthonormal Cartesian coordinate systems there is no difference between the covariant and 
contravariant representations, and hence the indices can be upper or lower although it is 
common to use lower indices in this case. 

The practical rules for writing the transformation equations, as seen in the above equa- 
tions, can be summarized in the following points where we take Eq. 75 (which is sufficiently 
general) as an example to illustrate the steps taken in writing the transformation equa- 
tions: 

1. We start by writing the symbol of the transformed tensor on the left hand side of 
the transformation equation and the symbol of the original tensor on the right hand 
side, that is: 


(75) 


A=A (76) 
2. We index the transformed tensor according to its intended indicial structure observ- 
ing the order of the indices. We similarly index the original tensor with its original 
indices, that is: 
Al n__ gik 
An =A; (77) 
3. We insert a number of partial differential operators on the right hand side equal to 
the number of free indices, that is: 


= Ou Ou Ou ,. 
bn Vee Aik 
Am = Duduoue ! (78) 


4. We index the coordinates of the transformed tensor in the numerator or denominator 
in these operators according to the order of the indices in the tensor where these 
indices are in the same position (upper or lower) as their position in the tensor, that 
is: 


OW Ob OU 22% 


AOS AY 79 

a = Ou Ou On d 2 

5. Because the transformed tensor is barred then its coordinates should also be barred, 
that is: 2 : 


m ~~ du du™ Ou 4 
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6. We then index the coordinates of the original tensor in the numerator or denominator 
in these operators according to the order of the indices in the tensor where these 
indices are in the opposite position (upper or lower) to their position in the tensor, 
that is: “ oe 

qin Ou’ Ou? Ou" ik 

mm Ou Our” Our 4 

We note that an upper coordinate index in the denominator of the partial differential oper- 

ators is equivalent to a lower tensor index. We also note that each pair of the transformed 

indices are on the two sides and hence they should be in the same position (upper or lower) 

according to the rules of free indices (see § 1.2), while the pairs of indices of the original 

tensor are in the same side and hence they should be in opposite positions according to 

the rules of dummy indices. 

The covariant and contravariant types of a tensor are linked through the metric tensor. 

As will be detailed later (refer to § 4.5), the contravariant metric tensor is used for raising 
covariant indices of covariant and mixed tensors, e.g. 


(81) 


where the contravariant metric tensor g* is used to raise the covariant index k to be- 
come the contravariant index 7. Similarly, the covariant metric tensor is used for lowering 
contravariant indices of contravariant and mixed tensors, e.g. 


A; = Gin A® Aig = Gin A®, (83) 


where the covariant metric tensor gj, is used to lower the contravariant index k to become 
the covariant index 27. Hence, the metric tensor is used to change the variance type of 
tensor indices and for that reason it is labeled as index raising or index lowering or index 
shifting operator. We note that in the raising and lowering operations, the metric tensor 
acts as an index replacement operator, as well as a shifting operator, by changing the label 
of the shifted index, as seen in the above examples where the shifted index k is replaced 
with 7 which is the other index of the metric tensor. 

Because it is possible to shift the index position of a tensor by using the covariant and 
contravariant types of the metric tensor as an index shifting operator, a given tensor can 
be cast into a covariant or a contravariant form, as well as a mixed form in the case of 
tensors of rank > 1. However, it should be emphasized that the order of the indices must 
be respected in this process, because two tensors with the same indicial structure but with 
different indicial order are not equal in general, as stated before. For example: 


A‘, = gjn-A” - jr A™ = A;' (84) 


Dots may be inserted to remove any ambiguity about the order of the indices and hence 


A’, or A‘; means 7 first and 7 second while A; or Aj means 7 first and 7 second. 


A tensor of m free contravariant indices and n free covariant indices may be called type 
(m,n) tensor. When one or both variance types are absent, zero is used to refer to the 


absent variance type in this notation. Accordingly, A‘, is a type (1,2) tensor, B’ is a 


7] 
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type (2,0) tensor, Ci, is a type (0,1) tensor, and Df*, is a type (2,3) tensor. As we will 
see in § 3.1.3, the type may also include the weight as a third entry and hence the type 
in this sense is identified by the symbol (m,n, w) where m and n refer to the number of 
contravariant and covariant indices respectively while w refers to the weight of the tensor. 
It is obvious that the rank of the tensor can be obtained from this notation by adding the 
first two entries and hence a tensor of type (m,n) or (m,n, w) is of rank m+n. 

As seen earlier, the vectors providing the basis set for a coordinate system are of covariant 
type when they are tangent to the coordinate curves, which represent the curves along 
which exactly one coordinate varies while the other coordinates are held constant, and 
they are of contravariant type when they are perpendicular to the coordinate surfaces 
which are the surfaces over which all coordinates vary except one which is held constant. 
Formally, the covariant and contravariant basis vectors are given respectively by: 


Ei = Vu" (85) 


where r = x,e; is the position vector in Cartesian coordinates and u’ is a general co- 
ordinate. As indicated earlier, a superscript in the denominator of partial derivatives is 
equivalent to a subscript in the numerator and hence the above equation is consistent with 
the rules of tensor indices which were outlined in § 1.2. 

We also remark that in general the basis vectors, whether covariant or contravariant, are 
not necessarily of unit length and/or mutually orthogonal although they may be so. In 
fact, there are standard mathematical procedures to orthonormalize the basis set if it is 
not and if orthonormalization is needed. For example, the two covariant basis vectors, KE, 
and Ep, of a 2D space (i.e. surface) can be orthonormalized as follows: 

FE, FE, E. = giE2 — gi2ky 


E ae 
= |E,| JG a JING 


where g is the determinant of the surface covariant metric tensor, the indexed g are the 
coefficients of this tensor, and the underlined vectors are orthonormal basis vectors, that 
is: 


(86) 


E,-E, =1 E,- E, = 1 E, - E, =0 (87) 
This can be verified by conducting the dot products of the last equation using the vectors 
defined in Eq. 86. 


As seen earlier, the two sets of covariant and contravariant basis vectors are reciprocal 
systems and hence they satisfy the following reciprocity relations: 


E; - E! = 6; E'. Bj = 6%, (88) 


where the indexed 6 is the Kronecker delta tensor (refer to § 4.1) which can be represented 
by the unity matrix. The reciprocity of these two sets of basis vectors is illustrated 
schematically in Figure 11 for the case of a 2D space. 
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Figure 11: The reciprocity relation between the covariant and contravariant basis vectors 
in a 2D space where E; | E?, E' | Es, and |E,||E"| cos ¢ = |Es| |E*| cos ¢ = 1. 


A vector can be represented either by covariant components with contravariant basis 
vectors or by contravariant components with covariant basis vectors. For example, a 
vector A can be expressed as: 


A = A,E' or A = A'E; (89) 


where E’ and E; are the contravariant and covariant basis vectors respectively. This is 
illustrated graphically in Figure 12 for a vector A in a 2D space. The use of the covariant 
or contravariant form of the vector representation is a matter of choice and convenience 
since these two representations are equivalent as they represent and correctly describe the 
same object. 

More generally, a tensor of any rank (> 1) can be represented covariantly using con- 
travariant basis tensors of that rank, or contravariantly using covariant basis tensors, or 
in a mixed form using a mixed basis of opposite type. For example, a rank-2 tensor A can 
be written as: 

A = A,E'E! = AVE,E; = AJE'E; = A',E,E! (90) 
where E'E’, E;E;, E’E; and E;E/ are dyadic products of the basis vectors of the presumed 
system (refer to § 3.2.3). We remark that dyadic products represent a combination of two 
vectors and hence they represent two directions in a certain order. Figure 13 is a graphic 
illustration of the nine dyadic products of the three unit basis vectors in a 3D space with 
a rectangular Cartesian coordinate system. 

Similarly, a rank-3 tensor B can be written as: 


B = Bj,E'E/E* = BY*E,E;E, = B/"E'E;E, = B',*E,B’E, =---ete. (91) 
where E'E/E* ...etc. are triads. More generally, a rank-n tensor C can be written as: 


C = Cy..nE'E! ... E” = C¥ "EE, +--+ E, = C2" "EE, --- En = ++ etc. (92) 
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Figure 12: The representation of a vector A in covariant and contravariant basis vector 
sets in a 2D space where the components shown at the four points are with reference to 
unit vectors in the given directions, e.g. A'|E,| is a component with reference to a unit 
vector in the direction of Ey. 


where E’E’ -.-E”... etc. are n-polyads. We note that the order of the indices in the tensor 
bases (i.e. dyads, triads and n-polyads) should be the same as the order of the indices in 
the tensor components, as seen in the above examples. 

Finally, it should be remarked that the two sets of basis vectors (i.e. covariant and 
contravariant), like the components of the tensors themselves, are identical for orthonormal 
Cartesian systems. This can be explained by the fact that since the coordinate surfaces 
are mutually perpendicular planes then the coordinate curves are mutually perpendicular 
straight lines and hence the tangent vectors to the x’ coordinate curve is the same as the 
perpendicular vector to the x’ coordinate surface noting that the coordinates are scaled 
uniformly in all directions. 


3.1.2. True and Pseudo Tensors 


These are also called polar and axial tensors respectively although it is more common 
to use these terms for vectors. Pseudo tensors may also be called tensor densities, how- 
ever the terminology in this part, like many other parts, is not universal. True tensors 
are proper or ordinary tensors and hence they are invariant under coordinate transforma- 
tions, while pseudo tensors are not proper tensors since they do not transform invariantly 
as they acquire a minus sign under improper orthogonal transformations which involve 
inversion of coordinate axes through the origin of coordinates with a change of system 
handedness. Figure 14 demonstrates the behavior of a true vector v and a pseudo vector 
p where the former keeps its direction in the space following a reflection of the coordinate 
system through the origin of coordinates while the latter reverses its direction following 
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Figure 13: The nine unit dyads associated with the double directions of rank-2 tensors 
in a 3D space with a rectangular Cartesian coordinate system. The vectors e; and e; 
(i,j = 1,2,3) are unit vectors in the directions of coordinate axes where the first indexed 
e (blue) represents the first vector of the dyad while the second indexed e (red) represents 
the second vector of the dyad. In these nine frames, the first vector is fixed along each 
row while the second vector is fixed along each column. 


this operation. 

Because true and pseudo tensors have different mathematical properties and represent 
different types of physical entities, the terms of consistent tensor expressions and equations 
should be uniform in their true and pseudo type, i.e. all terms should be true or all terms 
should be pseudo. The direct product (refer to § 3.2.3) of true tensors is a true tensor. 
The direct product of even number of pseudo tensors is a true tensor, while the direct 
product of odd number of pseudo tensors is a pseudo tensor. The direct product of a mix 
of true and pseudo tensors is a true or pseudo tensor depending on the number of pseudo 
tensors involved in the product as being even or odd respectively. 

Similar rules to those of the direct product apply to the cross product, including the curl 
operation, involving tensors (which are usually of rank-1) with the addition of a pseudo 


3.1.3 Absolute and Relative Tensors 57 


X3 
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Figure 14: The behavior of a true vector (v and V) and a pseudo vector (p and P) on 
reflecting the coordinate system in the origin of coordinates. The lower case symbols stand 
for the objects in the original system while the upper case symbols stand for the same 
objects in the reflected system. 


factor for each cross product operation. This factor is contributed by the permutation 
tensor which is implicit in the definition of the cross product. As we will see in § 4.2, the 
permutation tensor is a pseudo tensor. 

In summary, what determines the tensor type (true or pseudo) of the tensor terms 
involving direct and cross products is the parity of the multiplicative factors of pseudo type 
plus the number of cross product operations involved since each cross product operation 
contributes a permutation tensor. We note that inner product (see § 3.2.5) is the result of 
a direct product (see § 3.2.3) operation followed by a contraction (see § 3.2.4) and hence 
it is like a direct product in this context. 


3.1.3. Absolute and Relative Tensors 


Considering an arbitrary transformation from a general coordinate system to another, a 

tensor of weight w is defined by the following general tensor transformation relation: 

© Ou Oud = OU Out ue Ou 
outou" (Out onl da” du" 


aii Ou Beck 
Ar = | A a (93) 


Im...n ~ Ou 


where || symbolizes the Jacobian of the transformation between the two systems (see § 
2.3). When w = 0 the tensor is described as an absolute or true tensor, and when w 4 0 
the tensor is described as a relative tensor. When w = —1 the tensor may be described as a 
pseudo tensor, while when w = 1 the tensor may be described as a tensor density. We note 
that some of these labels are used differently by different authors as the terminology of 
tensor calculus is not universally approved and hence the conventions of each author should 
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be checked. Also, there is an obvious overlap between this classification (i.e. absolute and 
relative) and the previous classification (i.e. true and pseudo) at least according to some 
conventions. As indicated earlier (see § 3.1.1), a tensor of m contravariant indices and n 
covariant indices may be described as a tensor of type (m,n). This may be extended to 
include the weight w as a third entry and hence the type of the tensor is identified by 
(m,n, w). 

Relative tensors can be added and subtracted (see § 3.2.1) if they have the same indicial 
structure and have the same weight; the result is a tensor of the same indicial structure 
and weight. Also, relative tensors can be equated if they have the same indicial structure 
and weight. In brief, the terms of tensor expressions and equalities should have the same 
weight w, whether w is equal to zero or not. Multiplication of relative tensors produces 
a relative tensor whose weight is the sum of the weights of the original tensors. Hence, if 
the weights are added up to a non-zero value the result is a relative tensor of that weight; 
otherwise it is an absolute tensor. 

We remark that the statements of the previous paragraph can be generalized by including 
w = 0 which corresponds to absolute tensors and hence “relative” in those statements is 
more general than being opposite to “absolute”. Accordingly, and from the perspective 
of relative tensors (i.e. assuming that other qualifications such as matching in indicial 
structure, are met), two absolute tensors can be added or subtracted or equated but an 
absolute and a relative tensor (i.e. with w 4 0) cannot since they are “relative” tensors 
with different weights. 


3.1.4 Isotropic and Anisotropic Tensors 


Isotropic tensors are characterized by the property that the values of their components 
are invariant under coordinate transformation by proper rotation of axes. In contrast, the 
values of the components of anisotropic tensors are dependent on the orientation of the 
coordinate axes. Notable examples of isotropic tensors are scalars (rank-0), the vector 
O (rank-1), Kronecker delta (rank-2) and Levi-Civita tensor (rank > 2). Many tensors 
describing physical properties of materials, such as stress and magnetic susceptibility, are 
anisotropic. 

Direct and inner products (see § 3.2.3 and 3.2.5) of isotropic tensors are isotropic tensors. 
The zero tensor of any rank and any dimension is isotropic; therefore if the components 
of a tensor vanish in a particular coordinate system they will vanish in all properly and 
improperly rotated coordinate systems.!41 Consequently, if the components of two tensors 
are identical in a particular coordinate system they are identical in all transformed coor- 
dinate systems since the tensor of their difference is a zero tensor and hence it is invariant. 
This means that tensor equalities and identities are invariant under coordinate transfor- 
mations, which is one of the main motivations for the use of tensors in mathematics and 
science. As indicated, all rank-0 tensors (scalars) are isotropic. Also, the zero vector, 0, 
of any dimension is isotropic; in fact it is the only rank-1 isotropic tensor. 


[14] For improper rotation, this is more general than being isotropic. 
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3.1.5 Symmetric and Anti-symmetric Tensors 


These types of tensor apply to high ranks only (rank > 2) since symmetry and anti- 
symmetry of tensors require in their definition two free indices at least, and hence a 
scalar with no index and a vector with a single index do not qualify to be symmetric or 
anti-symmetric. Moreover, these types are not exhaustive, even for tensors of rank > 2, as 
there are high-rank tensors which are neither symmetric nor anti-symmetric, and hence we 
may call them asymmetric tensors although this terminology may include anti-symmetric 
tensors as well. 

A rank-2 tensor A;; is symmetric in its components zff for all i and j the following 
condition is satisfied: 


Aji = Ai (94) 
and anti-symmetric or skew-symmetric iff for all 2 and 7 the following condition is satisfied: 


Similar conditions apply to contravariant type tensors, i.e. A’ = A” for symmetric and 
As’ = — A” for anti-symmetric. This also applies to the following definitions and identities 
which are largely presented in covariant forms. 

More generally, a rank-n tensor Aj, ._;,, is Symmetric in its two indices i; and 7, aff the 
following condition applies identically: 


A = Ain ij tpucin (96) 


beige cin 
and anti-symmetric in its two indices 7; and 7, iff the following condition applies identically: 
Aig. apts Gis = Fa ap hjetn (97) 


Any rank-2 tensor A;; can be synthesized from (or decomposed into) a symmetric part 
Aq;), which is marked with round brackets enclosing the indices, and an anti-symmetric 
part Ajj), which is marked with square brackets, where the following relations apply: 


Ay = Ay + Aggy (98) 
1 

Ay) = 5 (Au + An) (99) 
1 

Aug = 5 (Ais — Aus) (100) 


The first relation can be verified by substituting the second and third relations into the 
first, that is: 


1 1 1 

Ai 2 
9 (Au 2 2 
which is an identity, while the second and third relations can be verified by shifting the 
indices, that is: 


Aig = Aci) + Afig) = 


Aji) + 5 (Aig — Aji) = 5 (2Aiy) = Aig (101) 


1 1 
Agi = 5 (Aji + Aij) = 5 (Ai + Aji) = AGy (102) 
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1 
Ay = 5 (Aji — Aig) = —5 (Aig — Ans) = Agia (103) 


and hence A,;) is symmetric and Aj,) is anti-symmetric. 
Similarly, a rank-3 tensor A;;;, can be symmetrized by the following relation: 


1 
3! 


Again = = (Auge + Ang + Ager + Ang + Apa + Aan) (104) 


and anti-symmetrized by the following relation: 


1 
3! 


Aisa = = (Ane + Ang t Aiea — Awe Aga = Agj) (105) 


where the first three terms in these equations are the even permutations of the indices ijk 
and the last three terms are the odd permutations of these indices. These relations can 
also be verified by exchanging the indices. For example: 


1 
3! 


A(jik) = (Ajin + Angi + Ainy + Ajei + Aije + Any) = A (ijk) (106) 


and hence it is symmetric in i and 7. Similarly: 


1 
3 


Apjik) = (Ape Age Age Ape Age Ay) = Aga (107) 
and hence it is anti-symmetric in 7 and 7. The exchange of other indices can be done 
similarly and hence the symmetric nature of A(x) and the anti-symmetric nature of Ajjx] 
with respect to their other indices can also be verified. The symmetric and anti-symmetric 
parts, A(jz) and Ajjqj, are related by the following equations: 


3 (Acizry + Ape) = Aaje + Anig + Agni (108) 
3 (Aqjxy — Apjiay) = Aang + Ayan + Ange (109) 


which can also be verified by substitution, as done for rank-2. The right hand side of the 
first equation is the sum of the even permutations of the indices 77k while the right hand 
side of the second equation is the sum of the odd permutations of these indices. 

More generally, a rank-n tensor Aj;,..;,, can be symmetrized by: 


iL 
AG 5 = a (Sceven permutations of 7’s + S Vodd permutations of i's) (110) 
and anti-symmetrized by: 


1 
Afit..in) = i (Screven permutations of 7’s — S “odd permutations of i's) (111) 


Similar verifications and relations can be established for the rank-n case following the 
patterns seen in the above rank-2 and rank-3 cases. 
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A tensor of high rank (> 2) may be symmetrized or anti-symmetrized with respect to 
only some of its indices instead of all of its indices. For example, in the following the 
tensor A;,, is symmetrized and anti-symmetrized respectively with respect to its first two 
indices only, that is: 


1 

Aur = 5 (Ain + Ajiz) (112) 
1 

Ajjrk = 5 (Ave = Apa) (113) 


The symmetry of Aq; and the anti-symmetry of Ajj, with respect to the 7j indices can 
be verified by exchanging these indices in the above relations as done previously for A(;j) 
and Ajj. Moreover, the tensor Aj; can be expressed as the sum of these symmetric and 
anti-symmetric tensors, that is: 


1 1 
Asya + Aligie = 5 (Auge + Agi) + 5 (Aisne — Agin) = Auge (114) 
A tensor is described as totally symmetric ff it is symmetric with respect to all of its 


indices, that is: 


Aig ocin = Alit..tin) (115) 


and hence its anti-symmetric part is zero, i.e. Ajj,...;,) = 0. Similarly, a tensor is described 
as totally anti-symmetric iff it is anti-symmetric in all of its indices, that is: 


Age gS Ayes (116) 


and hence its symmetric part is zero, i.e. Aj,..i,,) = 0. 

For a totally anti-symmetric tensor, non-zero entries can occur only when all the indices 
are different. This is because an exchange of two identical indices, which identifies identical 
entries, should change the sign due to the anti-symmetry and no number can be equal to 
its negation except zero. Therefore, if the tensor A;,;, is totally anti-symmetric then only 
its entries of the form A;;,, where 1 #4 7 # k, are not identically zero while all the other 
entries (i.e. all those of the from Ajj;, Ajj, Aji and Aj; where i 4 j with no sum on 7) 
vanish identically. In fact, this also applies to partially anti-symmetric tensors where the 
entries corresponding to identical anti-symmetric indices should vanish identically. Hence, 
if Ajj, is anti-symmetric in its first two indices and in its last two indices then only its 
entries with i 4 7 and k #1 do not vanish identically while all its entries with 7 = 7 or 
k =I (or both) are identically zero. 

It should be remarked that the indices whose exchange defines the symmetry and anti- 
symmetry relations should be of the same variance type, i.e. both upper or both lower. 
Hence, a tensor Aj is not symmetric if the components satisfy the relation A a A‘ or 
anti-symmetric if the components satisfy the relation Aj = — Ai . This should be obvious 
by considering that the covariant and contravariant indices correspond to different basis 
sets (i.e. contravariant and covariant), as explained in § 3.1.1. 

Another important remark is that the symmetry and anti-symmetry characteristic of a 
tensor is invariant under coordinate transformations. Hence, a symmetric/anti-symmetric 
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tensor in one coordinate system is symmetric/anti-symmetric in all other coordinate sys- 
tems. This is because if we label the tensor as A and its transformation by symmetry 
or anti-symmetry as B, then an equality of the two tensors based on their symmetric or 
anti-symmetric nature (i.e. A;; = Bj; or Aj; = —B;;) can be expressed, using an algebraic 
transformation, as a zero tensor (i.e. Aj; — Bj; = 0 or Aj; + Bj; = 0) which is invariant 
under coordinate transformations as stated earlier (see § 3.1.4). Similarly, a tensor which 
is neither symmetric nor anti-symmetric in one coordinate system should remain so in all 
other coordinate systems obtained by permissible transformations. This is based on the 
previous statement because if it is symmetric or anti-symmetric in one coordinate sys- 
tem it should be symmetric or anti-symmetric in all other coordinate systems according 
to the previous statement. In brief, the characteristic of a tensor as being symmetric or 
anti-symmetric or neither is invariant under permissible coordinate transformations. 

Finally, for a symmetric tensor A;; and an anti-symmetric tensor B” (or the other way 
around) we have the following useful and widely used identity: 


A;;BY =0 (117) 


This is because an exchange of the 2 and j indices will change the sign of one tensor 
only, which is the anti-symmetric tensor, and this will change the sign of the term in the 
summation resulting in having a sum of terms which is identically zero due to the fact 
that each term in the sum has its own negation. This also includes the zero components 
of the anti-symmetric tensor where the terms containing these components are zero. 


3.1.6 General and Affine Tensors 


Affine tensors are tensors that correspond to admissible linear coordinate transformations 
(see § 2.3.4) from an original rectangular system of coordinates, while general tensors 
correspond to any type of admissible coordinate transformations. These categories are 
part of the terminology of tensor calculus which we use in this book and hence they do 
not have a particular significance. 


3.2 Tensor Operations 


In this section, we briefly examine the main elementary tensor operations, which are 
mostly of algebraic nature, that permeate tensor algebra and calculus. There are various 
operations that can be performed on tensors to produce other tensors in general. Examples 
of these operations are addition/subtraction, multiplication by a scalar (rank-0 tensor), 
multiplication of tensors (each of rank > 0), contraction and permutation. Some of these 
operations, such as addition and multiplication, involve more than one tensor while others, 
such as permutation, are performed on a single tensor. Contraction can involve one tensor 
or two tensors. 

Before we start our investigation, we should remark that the last subsection (i.e. § 
3.2.7), which is about the quotient rule of tensors, is added to the present section of 
tensor operations because this section is the most appropriate place for it in the present 
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book considering the dependency of the definition of this rule on other tensor operations; 
otherwise the subsection is not about a tensor operation in the same sense as the operations 
presented in the other subsections. Another remark is that in tensor algebra division is 
allowed only for scalars, and hence if the components of an indexed tensor should appear in 
a denominator, the tensor should be redefined to avoid this. For example, if A; represents 
the components of a tensor and we should have x as the components of another tensor, 
then we introduce another tensor, say B;, whose components are the reciprocals of A; 
(ie. B; = a) and use B; instead of ae The purpose of this change of notation is to 
avoid confusion and facilitate the manipulation of tensors according to the familiar rules 


of indices. 


3.2.1 Addition and Subtraction 


Tensors of the same rank and type can be added algebraically to produce a tensor of the 
same rank and type, e.g. 


a=b+e A; = B - C; Ai = Bl +C% (118) 


In this operation the entries of the two tensors are added algebraically componentwise. 
For example, in the second equality of the last equation the C, component of the tensor 
C is subtracted from the B,; component of the tensor B to produce the A; component of 
the tensor A, while in the third equality the B} component of the tensor B is added to 
the C} component of the tensor C to produce the A} component of the tensor A. 

We note that “type” in the above statement refers to variance type (covariant, contravari- 
ant, mixed) and true/pseudo type as well as other qualifications to which the tensors par- 
ticipating in an addition or subtraction operation should match such as having the same 
weight if they are relative tensors, as outlined previously (refer for example to § 3.1.3). 
The added /subtracted terms should also have the same indicial structure with regard to 
their free indices, as explained in § 1.2 in the context of the rules governing the indices 
of tensor expressions and equalities. Hence, the tensors A‘, and Bi, cannot be added or 
subtracted although they are of the same rank and type, but the tensors A‘, and Bi, 
can be added and subtracted. In brief, the tensors involved in addition and subtraction 
operations should satisfy all the rules that have been stated previously on the terms of 
tensor expressions and equalities. 

Addition of tensors is associative and commutative, that is: 


(A+B)+C = A+(B+C) (119) 
A+B = B+A (120) 


In fact, associativity and commutativity can include subtraction if the minus sign is ab- 
sorbed in the subtracted tensor; in which case the operation is converted to addition. 
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3.2.2 Multiplication of Tensor by Scalar 


A tensor can be multiplied by a scalar, which should not be zero in general, to produce a 
tensor of the same variance type, rank and indicial structure, e.g. 


Ai, = aBi, (121) 


where a is a non-zero scalar. As indicated by this equation, multiplying a tensor by a 
scalar means multiplying each component of the tensor by that scalar. Multiplication by 
a scalar is commutative and distributive over algebraic sum of tensors, that is: 


aA = Aa (122) 
a(A+B) = aA+aB (123) 


where a is a scalar and A and B are non-scalar tensors. It is also associative when more 
than two factors are involved, that is: 


a(bA) = (ab) A (124) 
a(AoB) = (aA)oB=Ao (aB) (125) 


where 0 is another scalar and o represents an outer or inner product operation (see § 3.2.3 
and 3.2.5). 


3.2.3 Tensor Multiplication 


This operation, which can be defined generically as multiplication between two non-scalar 
tensors, may also be called outer or exterior or direct or dyadic multiplication, although 
some of these names may be reserved to operations on vectors. On multiplying each 
component of a tensor of rank r by each component of a tensor of rank k, both of dimension 
d, a tensor of rank (r+k) with d"** components is obtained where the variance type of each 
index (covariant or contravariant) is preserved. More clearly, by multiplying a tensor of 
type (m1, 71, w1) by a tensor of type (mga, nz, w2) a tensor of type (m1+ma, n1+N2, Wi +W2) 
is obtained. This means that the tensor rank and weight in the outer product operation 
are additive and the operation conserves the variance type of each index of the tensors 
involved. Also, the order of the indices in the product should match the order of the 
indices in the multiplicands, as will be seen in the following examples, since tensor indices 
refer to specific tensor bases and the multiplication of tensors is not commutative (refer 
tock 3.11. 

For example, if A and B are covariant tensors of rank-1, then on multiplying A by B 
we obtain a covariant tensor C of rank-2 and type (0,2) where the components of C are 
given by: 


while on multiplying B by A we obtain a covariant tensor D of rank-2 and type (0, 2) 
where the components of D are given by: 


Di; = BA; (127) 


3.2.3 Tensor Multiplication 65 


Similarly, if A is a contravariant tensor of rank-2 and B is a covariant tensor of rank-2, 
then on multiplying A by B we obtain a mixed tensor C of rank-4 and type (2,2) where 
the components of C are given by: 


CY, = AY Bu (128) 
while on multiplying B by A we obtain a mixed tensor D of rank-4 and type (2,2) where 
the components of D are given by: 


DP = B;A™ (129) 

In general, the outer product of tensors yields a tensor. In the outer product operation, 
it is generally understood that all the indices of the involved tensors have the same range 
(i.e. all the tensors have the same dimension) although this may not always be the case. As 
indicated before (see § 1.2), there are cases of tensors which are not uniformly dimensioned, 
and in some cases these tensors may be regarded as the result of an outer product of lower 
rank tensors. 

The direct multiplication of tensors may be marked by the symbol ®, mostly when using 
symbolic notation for tensors, e.g. A ® B. However, in the present book no symbol will 
be used to represent the operation of direct multiplication!®! and hence the operation is 
symbolized by putting the symbols of the tensors side by side, e.g. AB where A and B 
are non-scalar tensors. In this regard, the reader should be vigilant to avoid confusion 
with the operation of matrix multiplication which, according to the notation of matrix 
algebra, is also symbolized as AB where A and B are matrices of compatible dimensions, 
since matrix multiplication from tensor perspective is an inner product, rather than an 
outer product, operation. 

The direct multiplication of tensors is not commutative in general as indicated above; 
however it is distributive with respect to algebraic sum of tensors, that is: 


ABZBA (130) 
A(B+C)=AB+AC (BtC)A=BA+LCA (131) 


Regarding the associativity of direct multiplication, there are cases in which this operation 
is not associative according to the research literature of tensor calculus. 

As indicated before, the rank-2 tensor constructed by the direct multiplication of two 
vectors is commonly called dyad. More generally, tensors may be expressed as an outer 
product of vectors where the rank of the resultant product is equal to the number of the 
vectors involved, i.e. 2 for dyads, 3 for triads and so on. However, not every tensor can 
be synthesized as a product of lower rank tensors. Multiplication of a tensor by a scalar 
(refer to § 3.2.2) may be regarded as a special case of direct multiplication since it is a 
tensor multiplication operation with one of the tensors involved being of rank-0 although 
the definition of direct multiplication seems to restrict this operation to non-scalar tensors, 
as stated above. 


[15] We mean “specifically” because we use the symbol o to represent general tensor multiplication which 
includes inner and outer tensor products. 
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3.2.4 Contraction 


The contraction operation of a tensor of rank > 1 is to make two free indices identical, 
by unifying their symbols, followed by performing summation over these repeated indices, 


e.g. 


Aj contraction Ai (132) 
Ai‘ contraction on jl A™ (133) 
nS 


Accordingly, contraction results in a reduction of the rank by 2 since it requires the anni- 
hilation of two free indices by the summation operation. Therefore, the contraction of a 
rank-2 tensor results in a scalar, the contraction of a rank-3 tensor results in a vector, the 
contraction of a rank-4 tensor results in a rank-2 tensor, and so on. Contraction can also 
take place between two tensors as part of an inner product operation, as will be explained 
in § 3.2.5, although the contraction in this case is actually done on the tensor resulting 
from the outer product operation that underlies the inner product. 

For general coordinate systems, the pair of contracted indices should be different in their 
variance type, i.e. one upper and one lower. Hence, contraction of a mixed tensor of type 
(m,n) will in general produce a tensor of type (m — 1,n —1). A tensor of type (p,q) can 
therefore have p x q possible contractions, i.e. one contraction for each combination of 
lower and upper indices. As indicated before, there is no difference between the covariant 
and contravariant types in orthonormal Cartesian systems and hence contraction can take 
place between any pair of indices. Accordingly, a rank-n tensor in orthonormal Cartesian 
systems can have aie) possible individual contraction operations where each one of the n 
indices can be contracted with each one of the remaining (n—1) indices and the factor 2 in 
the denominator represents the fact that the contraction operation is independent of the 
order of the two contracted indices since contracting 2 with 7 is the same as contracting 
j with i.U6 We note that conducting a contraction operation on a tensor results into a 
tensor. Similarly, the application of a contraction operation on a relative tensor (see § 
3.1.3) produces a relative tensor of the same weight as the original tensor. 

A common example of a contraction operation conducted on a single tensor is the op- 
eration of taking the trace of a square matrix, as defined in matrix algebra, by summing 
its diagonal elements, which can be considered as a contraction operation on the rank-2 
tensor represented by the matrix, and hence it yields the trace which is a scalar. Similarly, 
a well known example of a contraction operation that takes place between two tensors is 
the dot product operation on vectors which can be considered as a direct multiplication 
(refer to § 3.2.3) of the two vectors that results in a rank-2 tensor followed by a contraction 
operation and hence it produces a scalar. 


3.2.5 Inner Product 


On taking the outer product (refer to § 3.2.3) of two tensors of rank > 1 followed by a 
contraction (refer to § 3.2.4) on two indices of the product, an inner product of the two 


116] This may also be formulated by the rule of combination. 
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tensors is formed. Hence, if one of the original tensors is of rank-m and the other is of 
rank-n, the inner product will be of rank-(m +n — 2). In the symbolic notation of tensor 
calculus, the inner product operation is usually symbolized by a single dot between the 
two tensors, e.g. A-B, to indicate the contraction operation which follows the outer 
multiplication. 

In general, the inner product is not commutative. When one or both of the tensors 
involved in the inner product are of rank > 1 then the order of the multiplicands does 
matter in general, that is: 


A-B4B-A (134) 


However, the inner product operation is distributive with respect to the algebraic sum of 
tensors, that is: 


A-(BiC)=A-B+A-C (BtC)-A=B-A+C-A (135) 


As indicated before (see § 3.2.4), the dot product of two vectors is an example of the 
inner product of tensors, i.e. it is an inner product of two rank-1 tensors to produce a 
rank-0 tensor. For example, if a is a covariant vector and b is a contravariant vector, then 
their dot product can be depicted as follows: 


[ab],’ = a,b’ contraction a-b=a,b! (136) 


Another common example, from linear algebra, of inner product is the multiplication of 
a matrix representing a rank-2 tensor, by a vector, which is a rank-1 tensor, to produce 
a vector. For example, if A is a rank-2 covariant tensor and b is a contravariant vector, 
then their inner product can be depicted, according to tensor calculus, as follows: 


[Ab], = Ajjb" contraction on jk [A - b], = Aj;b’ (137) 
pee RACES 


This operation is equivalent to the aforementioned operation of multiplying a matrix by a 
vector as defined in linear algebra. We note that we are using here the symbolic notation 
of tensor calculus, rather than the matrix notation, in writing Ab and A -b to represent, 
respectively, the outer and inner products. In matrix notation, Ab is used to represent the 
product of a matrix by a vector which is an inner product according to the terminology of 
tensor calculus. The multiplication of two n x n matrices, as defined in linear algebra, to 
produce another n x n matrix is another example of inner product. In this operation, each 
one of the matrices involved in the multiplication, as well as the product itself, represents 
a rank-2 tensor. 

For tensors whose outer product produces a tensor of rank > 2 and type (m,n) where 
m,n > 0, various contraction operations between different pairs of indices of opposite 
variance type can occur and hence more than one inner product, which are different in 
general, can be defined. Moreover, when the outer product produces a tensor of rank 
> 3 and type (m,n) where m,n > 1, more than one contraction operation can take place 
simultaneously. Based on what we have seen earlier (refer to § 3.2.3 and 3.2.4), the outer 
product of a tensor of type (p,q) by a tensor of type (s,t) produces a tensor of type 
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(p + s,q+t) and hence (p+) x (q+t) individual inner product operations can take 
place, i.e. one inner product for each combination of lower and upper indices. We note 
that for orthonormal Cartesian systems the variance type is irrelevant and hence a rank-n 
tensor can have oo individual inner product operations which is the number of possible 
contraction operations, as explained earlier (see § 3.2.4). 

There are more specialized types of inner product; some of these may be defined differ- 
ently by different authors. For example, a double contraction inner product of two rank-2 
tensors, A and B, may be defined and denoted by double vertically- or horizontally-aligned 
dots (ic. A: B or A--B) to indicate double contraction taking place between different 
pairs of indices. An instance of these specialized types is the inner product with double 
contraction of two dyads which is commonly defined by: 


ab : cd = (a-c) (b- d) (138) 


where the single dots in the right hand side of this equation symbolize the conventional 
dot product of two vectors. The result of this operation is obviously a scalar since it is 
the product of two scalars, as can be seen from the right hand side of the equation. 

Some authors may define a different type of double contraction inner product of two 
dyads, symbolized by two horizontally-aligned dots, which may be called a transposed 
contraction. This type of inner product is given by: 


ab --cd = ab: de = (a-d) (b-c) (139) 


where the result is also a scalar. The second equality of the last equation is based on Eq. 
138. We note that the double inner product operators, i.e. : and --, are defined by some 
authors opposite to the above definitions (i.e. the other way around) and hence one should 
be on the lookout for such differences in convention. 

For two rank-2 tensors, the aforementioned double contraction inner products are simi- 
larly defined as in the case of two dyads, that is: 


Inner products with higher multiplicities of contraction can be defined similarly, and hence 
they may be regarded as trivial extensions of the inner products with lower contraction 
multiplicities. Finally, we note that the inner product of tensors produces a tensor because 
the inner product is an outer product operation followed by a contraction operation and 
both of these operations on tensors produce tensors, as stated before (see 3.2.3 and 3.2.4). 


3.2.6 Permutation 


A tensor may be obtained by exchanging the indices of another tensor. For example, Ai; 
is a permutation of the tensor Aj,. A common example of the permutation operation of 
tensors is the transposition of a matrix representing a rank-2 tensor since the first and 
second indices, which represent the rows and columns of the matrix, are exchanged in this 
operation. It is obvious that tensor permutation applies only to tensors of rank > 1 since 
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no exchange of indices can take place in a scalar with no index or in a vector with a single 
index. Also, permutation may be restricted to indices of the same variance type. The 
collection of tensors obtained by permuting the indices of a given tensor may be called 
isomers. 


3.2.7 Tensor Test and Quotient Rule 


Sometimes a tensor-like object may be suspected for being a tensor; in such cases a test 
based on what is called the “quotient rule” can be used to clarify the situation. We should 
remark that the quotient rule of tensors must not be confused with the quotient rule 
of differentiation. According to the quotient rule of tensors, if the inner product of a 
suspected tensor by a known tensor is a tensor then the suspect is a tensor. In more 
formal terms, if it is not known if A is a tensor or not but it is known that B and C 
are tensors; moreover it is known that the following relation holds true in all rotated (i.e. 
properly-transformed) coordinate frames: 


Price ty ae teas == Oni Fieet (141) 


then A is a tensor. Here, A, B and C are respectively of ranks m, n and (m+n—2), where 
the rank of C is reduced by 2 due to the contraction on k which can be any index of A 
and B independently. We assume, of course, that the rules of contraction of indices, such 
as being of opposite variance type in the case of non-Cartesian coordinates, are satisfied in 
this operation. The form given by the above equation is based, for simplicity, on assuming 
a Cartesian system. 

Finally, we should remark that testing a suspected tensor for being a tensor can also 
be done by employing the first principles through direct application of the transforma- 
tion rules to see if the alleged tensor satisfies the transformation rules of tensors or not. 
However, using the quotient rule is generally more convenient and requires less work. An- 
other remark is that the quotient rule of tensors may be considered by some authors as a 
replacement for the division operation which is not defined for tensors. 


3.3 Tensor Representations 


So far, we are familiar with the covariant and contravariant (including mixed) repre- 
sentations of tensors. There is still another type of representation, that is the physical 
representation which is the common one in the scientific applications of tensor calculus 
such as fluid and continuum mechanics. The introduction and employment of the physical 
representation of tensors are justified by the fact that the covariant and contravariant 
basis vectors, as well as the covariant and contravariant components of a vector, do not 
in general have the same physical dimensions as explained earlier (see § 2.2). Moreover, 
the basis vectors may not have the same magnitude. This motivates the introduction 
of a more standard form of vectors by using physical components (which have the same 
dimensions) with normalized basis vectors (which are dimensionless with unit magnitude) 
where the metric tensor and the scale factors are employed to facilitate this process. The 
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normalization of the basis vectors is done by dividing each vector by its magnitude. For 
example, the normalized covariant and contravariant basis vectors of a general coordinate 
system, E; and E’, are given by:!!7l 


hk’ = — (no sum on 7) (142) 


‘(Ei |E"| 


which for an orthogonal coordinate system becomes (see Eq. 70): 


BE; = =— BE’ = —= =hE (no sum oni) (143) 


Gi NG J/g 
where gi and g” are the i‘” diagonal elements of the covariant and contravariant metric 
tensor respectively and h, is the scale factor of the i” coordinate as described previously 
(see § 2.5 and 2.6). 
Consequently, if the physical components of a contravariant vector are notated with a 
hat, then for an orthogonal system we have: 


: ee ~. BK; oe : ‘ 
A = AE, = A’'E, = A’— = A= /gGjA = h,A’ (no sum) (144) 
V Gu 


Similarly, for the covariant form of the vector we have: 


ene. , -F& . A; A. 
A=A,F’ = A,E’ = A; - => A; = 4/g# A; = —— = no sum) (145 
~ ViA = =F (no sum) (145) 


These definitions and processes can be easily extended to tensors of higher ranks as we 
will see next. 

The physical components of higher rank tensors are similarly defined as for rank-1 ten- 
sors by considering the basis vectors of the coordinate system of the space where similar 
simplifications apply to orthogonal coordinate systems. For example, for a rank-2 tensor 
A in an orthogonal coordinate system, the physical components can be represented by: 


; Ae 


A = rh (no sum on i or j, with basis E’E/) (146) 
weg 
AG = hh AY (no sum on i or j, with basis E,E,) (147) 
»,  h,A’, St ie 2 
A= he 4 (no sum on i or j, with basis E;E’) (148) 


On generalizing the above pattern, we conclude that the physical components of a tensor 
of type (m,n) in an orthogonal coordinate system are given by: 
Digquacde tt 


am Ati am 


Aga an oa ce (no sum on any index) (149) 


nm 


[171 We note that the factors in the denominators are scalars and hence the rules of indices are not violated. 
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where the symbols and basis tensors are defined similarly following the above examples. 

As a consequence of the last statements, in a space with a well defined metric any 
tensor can be expressed in covariant or contravariant (including mixed) or physical forms 
using different sets of basis tensors. Moreover, these forms can be transformed from 
each other by using the raising and lowering operators and scale factors. As before, for 
orthonormal Cartesian systems the covariant, contravariant and physical components are 
the same where the Kronecker delta is the metric tensor. This is because the covariant, 
contravariant and physical basis vectors are identical in these systems. 

More generally, for orthogonal coordinate systems the two sets of normalized covariant 
and contravariant basis vectors are identical as established earlier because the correspond- 
ing vectors of each basis set are in the same direction (see § 2.4 and 2.6), and hence the 
physical components corresponding to the covariant and contravariant components are 
identical as well. Consequently, for orthogonal coordinate systems with orthonormal basis 
vectors, the covariant, contravariant and physical components are identical because the 
normalized bases corresponding to these three forms are identical. 

The physical components of a tensor may be represented by the symbol of the tensor with 
subscripts denoting the coordinates of the employed coordinate system. For instance, if A 
is a vector in a 3D space with contravariant components A’ or covariant components Aj, 
its physical components in Cartesian, cylindrical, spherical and general curvilinear systems 
may be denoted by (Az, Ay, Az), (Ap, Ag, Az), (Ar, Ag, Ag) and (Ay, Ay, Aw) respectively. 
For consistency and dimensional homogeneity, the tensors in scientific applications are 
commonly represented by their physical components with a set of normalized basis vectors. 
The invariance of the tensor form then guarantees that the same tensor formulation is valid 
regardless of any particular coordinate system where standard tensor transformations can 
be used to convert from one form to another without affecting the validity and invariance 
of the formulation. 


3.4 Exercises and Revision 


3.1 Define “covariant” and “contravariant” tensors from the perspective of their notation 
and their transformation rules. 

3.2 Write the transformation relations for covariant and contravariant vectors and for co- 
variant, contravariant and mixed rank-2 tensors between different coordinate systems. 

3.3 State the practical rules for writing the transformation relations of tensors between 
different coordinate systems. 

3.4 What are the raising and lowering operators and how they provide the link between 
the covariant and contravariant types? 

3.5 A is a tensor of type (m,n) and B is a tensor of type (p,g,w). What this means? 
Write these tensors in their indicial form. 

3.6 Write the following equations in full tensor notation and explain their significance: 


E-2 


Ou! - 
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out 


3.8 


3.9 


3.10 


3.11 


3.12 


3.13 


3.14 


3.15 


3.16 


ot 


3.18 


3.19 


3.20 
3.21 


3.22 
3.23 
3.24 
3.25 
3.26 
3.27 


3.28 


3.29 


Write the orthonormalized form of the basis vectors in a 2D general coordinate system. 
Verify that these vectors are actually orthonormal. 
Why the following relations are labeled as the reciprocity relations? 

E; -E’ = 67 BE’. Ej = 0'; 
The components of the tensors A, B and C are given by: A:,/, BI yg and Cy! ;. 
Write these tensors in their full notation that includes their basis tensors. 
A, B and C are tensors of rank-2, rank-3 and rank-4 respectively in a given coordinate 
system. Write the components of these tensors with respect to the following basis 
tensors: , E’E”, E;E,E,,, and E,E’E,E”. 
What “dyad” means? Write all the nine unit dyads associated with the double direc- 
tions of rank-2 tensors in a 3D space with a rectangular Cartesian coordinate system 
(i.e. eye, °°: €3€3). 
Make a simple sketch of the nine dyads of exercise 3.11. 
Compare true and pseudo vectors making a clear distinction between the two with a 
simple illustrating plot. Generalize this to tensors of any rank. 
Justify the following statement: “The terms of consistent tensor expressions and equa- 
tions should be uniform in their true and pseudo type”. 
What is the curl of a pseudo vector from the perspective of true/pseudo qualification? 
Define absolute and relative tensors stating any necessary mathematical relations. 
What is the weight of the product of A and B where A is a tensor of type (1, 2, 2) 
and B is a tensor of type (0,3, —1)? 
Show that the determinant of a rank-2 absolute tensor A is a relative scalar and find 
the weight in the case of A being covariant and in the case of A being contravariant. 
Why the tensor terms of tensor expressions and equalities should have the same 
weight? 
What “isotropic” and “anisotropic” tensor mean? 
Give an example of an isotropic rank-2 tensor and another example of an anisotropic 
rank-3 tensor. 
What is the significance of the fact that the zero tensor of all ranks and all dimensions 
is isotropic with regard to the invariance of tensors under coordinate transformations? 
Define “symmetric” and “anti-symmetric” tensor. Why scalars and vectors are not 
qualified to be symmetric or anti-symmetric? 
Write the symmetric and anti-symmetric parts of the tensor A’. 
Write the symmetrization and anti-symmetrization formulae for a rank-n tensor A’, 
Symmetrize and anti- symmetrize the tensor A‘/*' with respect to its second and fourth 
indices. 
Write the two mathematical conditions for a rank-n tensor A’’" to be totally sym- 
metric and totally anti-symmetric. 
The tensor A;,,;, is totally symmetric. How many distinct components it has in a 3D 
space? 
The tensor B;,;, is totally anti-symmetric. How many identically vanishing components 
it has in a 3D space? How many distinct non-identically vanishing components it has 


3.4 Exercises and Revision 73 


3.30 


3.31 


3.32 


3.33 


3.34 


3.35 


3.36 


3.37 


3.38 


3.39 


3.40 


3.41 


3.42 


3.43 


3.44 


3.45 


3.46 


3.47 


in a 3D space? 

Give numeric or symbolic examples of a rank-2 symmetric tensor and a rank-2 skew- 
symmetric tensor in a 4D space. Count the number of independent non-zero compo- 
nents in each case. 

Write the formula for the number of independent components of a rank-2 symmetric 
tensor, and the formula for the number of independent non-zero components of a 
rank-2 anti-symmetric tensor in nD space. 

Explain why the entries corresponding to identical anti-symmetric indices should van- 
ish identically. 

Why the indices whose exchange defines the symmetry and anti-symmetry relations 
should be of the same variance type? 

Discuss the significance of the fact that the symmetry and anti-symmetry character- 
istic of a tensor is invariant under coordinate transformations and link this to the 
invariance of the zero tensor. 

Verify the relation A;;B’ = 0, where Aj; is a symmetric tensor and BY is an anti- 
symmetric tensor, by writing the sum in full assuming a 3D space. 

Classify the common tensor operations with respect to the number of tensors involved 
in these operations. 

Which of the following operations are commutative, associative or distributive when 
these properties apply: algebraic addition, algebraic subtraction, multiplication by a 
scalar, outer multiplication, and inner multiplication? 

For question 3.37, write all the required mathematical relations that describe those 
properties. 

The tensors involved in tensor addition, subtraction or equality should be compatible 
in their types. Give all the details about these “types”. 

What is the meaning of multiplying a tensor by a scalar in terms of the components 
of the tensor? 

A tensor of type (m 1,71, W 1) is multiplied by another tensor of type (m2, no, we). 
What is the type, the rank and the weight of the product? 

We have two tensors: A = A, E'E! and B = B*'E,,E,;. We also have C = AB and 
D =BA. Use the properties of tensor operations to obtain the full expression of C 
and D in terms of their components and basis tensors (i.e. C = AB =---etc.). 
Explain why tensor multiplication, unlike ordinary multiplication of scalars, is not 
commutative considering the basis tensors to which the tensors are referred. 

The direct product of vectors a and b is ab. Edit the following equation by adding a 
simple notation to make it correct without changing the order: ab = ba. 

What is the difference in notation between matrix multiplication and tensor multipli- 
cation of two tensors, A and B, when we write AB? 

Define the contraction operation of tensors. Why this operation cannot be conducted 
on scalars and vectors? 

In reference to general coordinate systems, a single contraction operation is conducted 
on a tensor of type (m,n,w) where m,n > 0. What is the rank, the type and the 
weight of the contracted tensor? 
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3.48 


3.49 
3.50 
3.01 
3.02 
3.53 
3.54 
3.59 
3.56 


3.07 
3.98 


3.59 


3.60 
3.61 


3.62 


3.63 


3.64 


3.65 


3.66 


3.67 


3.68 


3.69 


3.70 


3.71 


What is the condition that should be satisfied by the two tensor indices involved in a 
contraction operation assuming a general coordinate system? What about tensors in 
orthonormal Cartesian systems? 

How many individual contraction operations can take place in a tensor of type (m,n, w) 
in a general coordinate system? Explain why. 

How many individual contraction operations can take place in a rank-n tensor in an 
orthonormal Cartesian coordinate system? Explain why. 

List all the possible single contraction operations that can take place in the tensor 


List all the possible double contraction operations that can take place in the tensor 
Ae oes 

Give examples of contraction operation from matrix algebra. 

Show that contracting a rank-n tensor results in a rank-(n — 2) tensor. 

Discuss inner multiplication of tensors as an operation composed of two more simple 
operations. 

Give common examples of inner product operation from linear algebra and vector 
calculus. 

Why inner product operation is not commutative in general? 

Complete the following equations where A and B are rank-2 tensors of opposite 
variance type: 


A: B=? A--B=? 


Write ab: cd in component form assuming a Cartesian system. Repeat this with 
ab - - cd. 

Why the operation of inner multiplication of tensors results in a tensor? 

We have: A = A'E;, B = B jE! and C = C™E,,E”. Find the following tensor 
products: AB, AC and BC. 

Referring to the tensors in question 3.61, find the following dot products: B-B, C-A 
and C -B. 

Define permutation of tensors giving an example of this operation from matrix algebra. 
State the quotient rule of tensors in words and in a formal mathematical form. 

Why the quotient rule is usually used in tensor tests instead of applying the transfor- 
mation rules? 

Outline the similarities and differences between the three main forms of tensor repre- 
sentation, 1.e. covariant, contravariant and physical. 

Define, mathematically, the physical basis vectors, FE; and E’, in terms of the covariant 
and contravariant basis vectors, E; and E’. 

Correct, if necessary, the following relation: Aikn = ke 
where A is a tensor in an orthogonal coordinate system. 
Why the normalized covariant, contravariant and physical basis vectors are identical 
in orthogonal coordinate systems? 

What is the physical significance of being able to transform one type of tensors to 
other types as well as transforming between different coordinate systems? 

Why the physical representation of tensors is usually preferred in the scientific appli- 


A‘*” (no sum on any index) 
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cations of tensor calculus? 

3.72 Give a few common examples of physical representation of tensors in mathematical 
and scientific applications. 

3.73 What is the advantage of representing the physical components of a tensor (e.g. A) 
by the symbol of the tensor with subscripts denoting the coordinates of the employed 
coordinate system, e.g. (A,, Ag, Ag) in spherical coordinate systems? 


Chapter 4 
Special Tensors 


The subject of investigation of this chapter is those tensors that form an essential part of 
the tensor calculus theory itself, namely the Kronecker, the permutation and the metric 
tensors. The chapter also includes a section devoted to the generalized Kronecker delta 
tensor which may be regarded as a bridge between the ordinary Kronecker delta tensor 
and the permutation tensor. There is also another section in which a number of important 
tensor identities related to the Kronecker or/and permutation tensors are collected and 
discussed. Finally, there is a section devoted to some important mathematical definitions 
and applications in which special tensors are employed. 

The Kronecker and permutation tensors are of particular importance in tensor calculus 
due to their distinctive properties and unique transformation attributes. They are nu- 
merical tensors with invariant components in all coordinate systems. They enter in the 
definition of many mathematical objects in tensor calculus and are used to facilitate the 
formulation of many tensor identities. Similarly, the metric tensor is one of the most 
important tensors (and may even be the most important) in tensor calculus and its ap- 
plications, as will be revealed in § 4.5 and other parts of this book. In fact, it permeates 
the whole subject of tensor calculus due to its role in characterizing the space and its 
involvement in essential mathematical definitions, operations and transformations. For 
example, it enters in the definition of many mathematical concepts, such as curve length 
and surface area, and facilitates the transformation between covariant and contravariant 
types of tensors and their basis vectors. 


4.1 Kronecker delta Tensor 


As indicated above, there are two types of Kronecker delta tensor: ordinary and general- 
ized. In the present section we discuss the ordinary Kronecker delta tensor and in § 4.4 
we investigate the generalized Kronecker delta tensor. 

The ordinary Kronecker delta tensor, which may also be called the unit tensor, is a 
rank-2 numeric, absolute, symmetric, constant, isotropic tensor in all dimensions. It is 
defined in its covariant form as: 


1 e=7) £3 

Oi; = i: (i SD oe) (150) 
: f (i #j) 

where n is the space dimension, and hence it can be considered as the identity tensor or 

matrix. For example, in a 3D space the Kronecker delta tensor is given by: 


O11 O12 013 ii 0 0 
[5:5] — 691 699 693 = 0 1 0 (151) 
031 032 033 0 0 1 
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The contravariant and mixed forms of the ordinary Kronecker delta tensor, i.e. 6 and 
6), are similarly defined. Consequently, the numerical values of the components of the 
covariant, contravariant and mixed types of the Kronecker delta tensor are the same, that 
is: 


bij = 07 = 5 = 6) (152) 


and hence they are all defined by Eq. 150. We note that in the last equation (and in any 
similar equation) the numerical values of the components (not the tensors) are equated 
and hence this is not a violation of the rules of tensor indices as stated in § 1.2. The tensor 
is made of components with reference to a set of basis vectors and hence the equality of 
the components of two tensors of different variance type does not imply the equality of 
the two tensors since the two basis vector sets to which the two tensors are referred can 
be different. 
From the above definitions, it can be seen that the Kronecker delta tensor is symmetric, 
that is: 
big = Oj 64 = 6" (153) 


where 2,7 = 1,2,...,n. Moreover, the tensor is conserved under all proper and improper 
coordinate transformations where “conserved” means that the tensor keeps the numerical 
values of its components following a coordinate transformation. Since it is conserved 
under proper transformations, it is an isotropic tensor. We note that being conserved 
under all transformations is stronger than being isotropic as the former applies even under 
improper coordinate transformations while isotropy is restricted, by definition, to proper 
transformations (see § 3.1.4). We also used “conserved” rather than “invariant” to indicate 
the preservation of the components and to avoid confusion with form-invariance which is 
a property that characterizes all tensors. 


4.2 Permutation Tensor 


The permutation tensor, which is a numeric tensor with constant components, is also 
known as the Levi-Civita, anti-symmetric and alternating tensor. We note that the “Levi- 
Civita” label is usually used for the rank-3 permutation tensor. Also some authors distin- 
guish between the permutation tensor and the Levi-Civita tensor even for rank-3. More- 
over, some of the common labels and descriptions of the permutation tensor are more 
specific to rank-3. Hence, differences in conventions, definitions and labels should be con- 
sidered when reading the literature of tensor calculus related to the permutation tensor. 
The permutation tensor has a rank equal to the number of dimensions of the space. 
Hence, a rank-n permutation tensor has n” components. This tensor is characterized by 
the following properties: 
1. It is numeric tensor and hence the value of its components are: —1, 1 and 0 in all 
coordinate systems. 
2. The value of any particular component (e.g. €312) of this tensor is the same in any 
coordinate system and hence it is constant tensor in this sense. 
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3. It is relative tensor of weight —1 for its covariant form and +1 for its contravariant 
form. 

4. It is isotropic tensor (see § 3.1.4) since its components are conserved under proper 
transformations. 

5. It is totally anti-symmetric in each pair of its indices, i.e. it changes sign on swapping 
any two of its indices. 

6. It is pseudo tensor (see § 3.1.2) since it acquires a minus sign under improper or- 
thogonal transformation of coordinates. 

7. The permutation tensor of any rank has only one independent non-vanishing com- 
ponent because all the non-zero components of this tensor are of unity magnitude. 

8. The rank-n permutation tensor possesses n! non-zero components which is the num- 
ber of the non-repetitive permutations of its indices. 

9. The rank and the dimension of the permutation tensor are identical and hence in an 
nD space it has n” components. 

The rank-2 permutation tensor €;; in its covariant form is defined by: [28] 


€j2 = 1 €21, = —1 €j1 = €22 = 0 (154) 


Similarly, the rank-3 permutation tensor €;;, in its covariant form is defined by: 


1 (i,j,k is even permutation of 1,2,3) 
€ijk = ( —1 (i,j,k is odd permutation of 1,2,3) (155) 
0 (repeated index) 


The contravariant form of the rank-2 and rank-3 permutation tensors is similarly defined. 

Figure 15 is a graphical illustration of the rank-3 permutation tensor €;;, while Figure 
16, which may be used as a mnemonic device, demonstrates the cyclic nature of the three 
even permutations of the indices of the rank-3 permutation tensor and the three odd 
permutations of these indices assuming no repetition of indices. The three permutations 
in each case are obtained by starting from a given number in the cycle and rotating in the 
given sense to obtain the other two numbers in the permutation. 

The definition of the rank-n permutation tensor (i.e. €j,45..;,, and ¢’!2’") is similar to 
the definition of the rank-3 permutation tensor with regard to the repetition in its indices 


(71, 72,+++ in) and being even or odd permutations in their correspondence to (1, 2,--- ,7), 
that is: 
1 (71, %2,..-,%, is even permutation of 1,2,...,7) 
Gp = Cer Sx =I (i1,72,..-,2, is odd permutation of 1,2,...,n) (156) 


0 (repeated index) 


As stated before, equations like this defines the numeric values of the tensor components 
and hence they do not violate the rules of indices with regard to their variance type (see 
§ 1.2) or the rules of relative tensors with regard to their weight (see § 3.1.3). 


118] There is no rank-1 permutation tensor as there is no possibility of permutation in a 1D space. 
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€332 


€111 €112 C113 


Figure 15: Graphical illustration of the rank-3 permutation tensor €;;, where circular nodes 
represent 0, square nodes represent 1 and triangular nodes represent —1. 


As well as the inductive definition of the permutation tensor (as given by Eqs. 154, 155 
and 156), the permutation tensor of any rank can also be defined analytically where the 
entries of the tensor are calculated from closed form formulae. Accordingly, the values of 
the entries of the rank-2 permutation tensor can be calculated from the following closed 
form equation: 

ey = €7 = (j -4) (157) 
where each one of the indices 7,7 ranges over 1,2. Similarly, the numerical values of the 
entries of the rank-3 permutation tensor are given by: 


ce = = FD R-(-S) (158) 


where each one of the indices 7,7,k ranges over 1,2,3. As for the rank-4 permutation 
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Even Odd 


Figure 16: Graphical demonstration of the cyclic nature of the even and odd permutations 
of the indices of the rank-3 permutation tensor assuming no repetition in indices. 


tensor we have: 
equa" = 5 (i — 7) (k—-t) G—-4) (K-97) U-9) UC -&) (159) 


where each one of the indices 7, 7, k,/ ranges over 1, 2,3, 4. 
More generally, the numerical values of the entries of the rank-n permutation tensor can 
be obtained from the following formula: 


Easagan = euaa an — vl F I] (a; 2: 7) = Swan I] (a; — a;) (160) 


i=1 LL j=i+l 1<i<j<n 
where each one of the indices a1,--- ,@, ranges over 1,--- ,n and S(n — 1) is the super 


factorial function of the argument (n — 1) which is defined by: 


S(k) =[]#=1!-21-...-4! (161) 


i=1 


A simpler formula for calculating the numerical values of the entries of the rank-n permu- 
tation tensor can be obtained from the previous one (Eq. 160) by dropping the magnitude 
of the multiplication factors and taking their signs only, that is: 


Eayag~am = EN" = [sgn (ay — a) = sen ( Il @- «)) (162) 
1<i<j<n 1l<i<j<n 


where sgn(k) is the sign function of the argument k which is defined by: 


+1 (k >0 
sen(k) = < —1 (k <0) (163) 
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The sign function in Eq. 162 can be expressed in a more direct form for the non-vanishing 
entries of the permutation tensor (which correspond to the non-repetitive permutations) 
by dividing each argument of the multiplicative factors in Eq. 162 by its absolute value, 
noting that none of these factors is zero, and hence Eq. 162 becomes: 


Eaya2--an — Gre I] ( 4 i (a; a aj) (164) 
1<i<j<n Ja; — a] 


Regarding the vanishing entries, they are easily identified by having repeated indices and 
hence all the entries of the tensor are easily evaluated. 

As stated above, the permutation tensor is totally anti-symmetric (see § 3.1.5) in each 
pair of its indices, i.e. it changes sign on swapping any two of its indices, that is: 


Cit igecipecin = 41 .ip..tg ein is = —-€ 165 


The reason is that any exchange of two indices requires an even/odd number of single- 
step shifts to the right of the first index plus an odd/even number of single-step shifts to 
the left of the second index, so the total number of shifts is odd and hence it is an odd 
permutation of the original arrangement. This may also be concluded from the closed 
form formulae (e.g. Eq. 162) where any exchange will lead to an exchange of the indices 
of a single multiplicative factor which leads to sign change.!9! Also, the permutation 
tensor is a pseudo tensor since it acquires a minus sign under an improper orthogonal 
transformation of coordinates, i.e. inversion of axes that changes the system handedness 
(see § 2.3.1). However, it is an isotropic tensor since it is conserved under proper coordinate 
transformations. 

The permutation tensor may be considered as a contravariant relative tensor of weight 
+1 or as a covariant relative tensor of weight —1. Since the contravariant and covariant 
types of the permutation tensor are relative tensors, it is desirable to define absolute 
covariant and contravariant forms of the permutation tensor. This is done by using the 
metric tensor, that is: 


Ein cin = VG Fit.nvin citvin — _T_ gltin (166) 


where the indexed € and € are respectively the relative and absolute permutation tensors 
of the given type, and g is the determinant of the covariant metric tensor g;;. Referring 
to Eqs. 63 and 93, we see that the € tensors are absolute with w = 0. We note that 
the contravariant form of the absolute permutation tensor requires a sign function but the 
details are out of the scope of the present text (see Zwillinger in the References). However, 
for the rank-3 permutation tensor, which is the one used mostly in the present book, the 
above expression stands as it is. 


[19] The non-existence of a mixed type permutation tensor is also related to its totally anti-symmetric 
nature. 
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The contravariant and covariant types of the rank-3 permutation tensor are linked, 
through the Kronecker delta tensor, by the following relations: 


EFF emp = 0162, — 56) (167) 
E OreOn 

6 etn =O 01, OF (168) 
oT Om On 


where the indexed 6 represent the mixed form of the ordinary Kronecker delta. The last 
equation can be generalized to nD spaces as follows: 


52 92... 62 

dytget -_ ft J J2 In 
. "Ejijanjn = | 6 ee. é (169) 

dn ‘ti dn 

On OF oF 


The contravariant and covariant types of the rank-n permutation tensor are also linked by 
the following relation: 


¢ittantn Cijtig--in = n! (170) 


where n is the space dimension. The left hand side of this equation represents the sum of 
the products of the corresponding entries of the contravariant and covariant types of the 
permutation tensor. 

On comparing Eq. 169 and the upcoming Eq. 199 we obtain the following identity which 
links the contravariant and covariant types of the permutation tensor to each other and 
to the generalized Kronecker delta: 
ete €51...5n = jitn (171) 


JieIn 


More details about these identities will be given in the subsequent sections of this chapter. 


4.3 Identities Involving Kronecker or/and Permutation Tensors 
4.3.1 Identities Involving Kronecker delta Tensor 


When an index of the Kronecker delta tensor is involved in a contraction operation by 
repeating an index in another tensor in its own term, the effect of this is to replace the 
shared index in the other tensor by the other index of the Kronecker delta, that is: 


6A; = Aj 5,Aj = A’ (172) 


In such cases the Kronecker delta is described as an index replacement or substitution 
operator. Hence, we have: 


Ora OF (173) 
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Similarly, we have: 

51 0 Sj S000 San (174) 
where n is the space dimension. The last part of this equation (i.e. 6) = n) can be easily 
justified by the fact that 6! is the trace of the identity tensor considering the summation 
convention. 

Due to the fact that the coordinates are independent of each other (see § 2.2), we also 
have the following identity which is based on the well known rules of partial differentiation: 


Ous 


Hence, in an nD space we obtain the following identity from the identities of Eqs. 175 and 
174: 


eas The oy (175) 


diu' = ds =n (176) 


Based on the above identities and facts, the following identity can be shown to apply in 
orthonormal Cartesian coordinate systems: 


Ox; Ox; 
j 


This identity is based on the two facts that the coordinates are independent, and the 
covariant and contravariant types are the same in orthonormal Cartesian coordinate sys- 
tems. 

Similarly, for a set of orthonormal vectors, such as the basis set of orthonormal Cartesian 
system, the following identity can be easily proved: 


e;-e; = On (178) 


where the indexed e represents the orthonormal vectors. This identity is no more than 
a symbolic statement of the fact that the vectors in such a set are mutually orthogonal 
and each one is of unit length. We note that for orthonormal basis sets the covariant and 
contravariant forms are identical as explained earlier. 

Finally, the double inner product of two dyads formed by an orthonormal set of vectors 
satisfies the following identity (see § 3.2.5): 


e;e;: ene; = (e; - ex) (€; - €1) = bin d51 (179) 


This identity is a combination of Eq. 138 and Eq. 178. 


4.3.2 Identities Involving Permutation Tensor 


From the definition of the rank-3 permutation tensor, we obtain the following identity 
which demonstrates the sense of cyclic order of the non-repetitive permutations of this 
tensor: 


€ijk = Ekig = Ejki = —€iky = —Ejik = —Ekji (180) 
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This identity is also a demonstration of the fact that the rank-3 permutation tensor is 
totally anti-symmetric in all of its indices since a shift of any two indices reverses its 
sign. Moreover, it reflects the fact that this tensor has only one independent non-zero 
component since any one of the non-zero entries, all of which are given by Eq. 180, can be 
obtained from any other one of these entries by at most a reversal of sign. This identity 
also applies to the contravariant form of the permutation tensor. 

We also have the following identity for the rank-n permutation tensor: 


citanin gg = (181) 


This identity is based on the fact that the left hand side is the sum of the squares of the 
epsilon symbol over all the n! non-repetitive permutations of n different indices where the 
value of epsilon of each one of these permutations is either +1 or —1 and hence in both 
cases their square is +1. The repetitive permutations are zero and hence they do not 
contribute to the sum. 

The double inner product of the rank-3 permutation tensor and a symmetric tensor A 
is given by the following identity: 


€ijn AY = €ijn A” = €:jn AP” =0 (182) 


This is because an exchange of the two indices of A does not affect its value due to the 
symmetry of A whereas a similar exchange of these indices in €;;, results in a sign change; 
hence each term in the sum has its own negative and therefore the total sum is identically 
zero. This identity also applies to the contravariant permutation tensor with a covariant 
symmetric tensor. 

Another identity with a trivial outcome that involves the rank-3 permutation tensor and 
a vector A is the following: 


€;jnA' A! => €ijnA' AX => €:jnA? A* = 0 (183) 


This can be explained similarly by the fact that, due to the commutativity of ordinary 
multiplication, an exchange of the indices in A’s will not affect the value but a similar 
exchange in the corresponding indices of €;;, will cause a change in sign; hence each term 
in the sum has its own negative and therefore the total sum will be zero. This identity 
also applies to the contravariant permutation tensor with a covariant vector. We also note 
that the identities of Eqs. 182 and 183 similarly apply to other ranks of the permutation 
tensor as they are based on the totally anti-symmetric property of this tensor. 

Finally, for a set of three orthonormal vectors forming a right handed system, the fol- 
lowing identities are satisfied: 


ei xe; = Cjjkek (184) 
ej (€; X ex) = €igk (185) 
These identities are based, respectively, on the forthcoming definitions of the cross product 


(see Eq. 494) and the scalar triple product (see Eq. 495) in tensor notation plus the fact 
that these vectors are unit vectors. 
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Table 2: Truth table for the identity of Eq. 186. 


i j k; l ad Ek aur i Of [ On O40) — 919% 
1 1 1 1 0 0 0 1 1 1 1 0 
1 1 1 2 0 1 0 1 0 0 1 0 
1 1 2 1 0 -1 0 0 1 1 0 0 
1 1 2 2 0 0 0 0 0 0 0 0 
1 2 1 1 1 0 0 1 0 1 0 0 
1 2 1 2 1 1 1 1 1 0 0 1 
1 2 2 1 1 -1 -1 0 0 1 1 -1 
1 2 2 2 1 0 0 0 1 0 1 0 
2 1 1 1 -1 0 0 0 1 0 1 0 
2 1 1 2 -1 1 -1 0 0 1 1 -1 
2 1 2 1 -1 -1 1 1 1 0 0 1 
2 1 2 2 -1 0 0 1 0 1 0 0 
2 2 1 1 0 0 0 0 0 0 0 0 
2 2 1 2 0 0 0 1 1 0 0 
2 2 2 1 0 - 0 1 0 0 1 0 
2 2 2 2 0 0 1 1 1 1 0 
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For the rank-2 permutation tensor, we have the following identity which involves the 
ordinary Kronecker delta tensor in 2D: 


Jeg = | ; 7 | — 5,01 ma 5104 (186) 
Pes 

This identity can be proved inductively by building a table for the values on the left 
and right hand sides as the indices are varied, as seen in Table 2. The pattern of the 
indices in the determinant array of this identity is simple, that is the indices of the first 
€ provide the indices for the rows as the upper indices of the deltas while the indices of 
the second € provide the indices for the columns as the lower indices of the deltas. In 
fact, the role of these indices in indexing the rows and columns can be shifted. This can 
be explained by the fact that the positions of the two epsilons can be exchanged, since 
ordinary multiplication is commutative, and hence the role of the epsilons in providing 
the indices for the rows and columns will be shifted. This can also be done by taking 
the transposition of the array of the determinant, which does not change the value of the 
determinant since det (A) = det (A”). 

We note that a table like Table 2 is similar to the truth tables used in verifying Boolean 
and logic identities, and for this reason we label it as a “truth table”. We also note that the 
method of building a “truth table” like Table 2 can also be used for proving other similar 
identities. The main advantage of this method is that it is easy and straightforward while 
its main disadvantage is that it is lengthy and hence it may not be feasible for some messy 
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identities. Moreover, it can provide proofs for special cases but it is not general with 
respect to proving similar identities in a general nD space where n is variable although it 
may be used as a part of an inductive proof. Another advantage of this method is that it 
lends itself to programming and hence it is ideal for use in computing. We note that the 
subject matter in these tables is the tensor components. 

Another useful identity involving the rank-2 permutation tensor with the Kronecker delta 
tensor in 2D is the following: 
es = Op (187) 


é 


This can be obtained from the identity of Eq. 186 by replacing j with / followed by 
minimal algebraic manipulations using tensor calculus rules, that is: 


een, = 64.6) — O16}, (Eq. 186 with 7 > 1) (188) 
= 261 — dd) (Eq. 174) 
= 26: — 6, (Eq. 173) 
= 6} 


Similarly, we have the following identity which correlates the rank-3 permutation tensor 
to the Kronecker delta tensor in 3D: 


a § 6! 
eK emn =| 51, 62 |= (5162,6% + 5% 6267 + 54,6768) — (189) 
ot Om On 


(15,5 + 551 5n, + 95551 ) 


Again, the indices in the determinant of this identity follow the same pattern as that of 
Eq. 186. Another pattern can also be seen in the six terms on the right where the three 
upper indices of all terms are 77k while the three lower indices in the positive terms are 
the even permutations of /mn and the three lower indices in the negative terms are the 
odd permutations of lmn. This identity can also be established by a truth table similar 
to Table 2. 

More generally, the determinantal form of Eqs. 186 and 189, which link the rank-2 and 
rank-3 permutation tensors to the ordinary Kronecker delta tensors in 2D and 3D spaces, 
can be extended to link the rank-n permutation tensor to the ordinary Kronecker delta 
tensor in an nD space, that is: 


62 §2 ... §2 

dytget — 2 = Jil, J2 In 
€ ” Cj join = . : es: (190) 

in in in 

re Or rk 


Again, the pattern of the indices in the determinant of this identity in their relation to the 
indices of the two epsilons follow the same rules as those of Eqs. 186 and 189. Moreover, 
the proofs of Eqs. 186 and 189 may be extended to Eq. 190 by induction. 
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Another useful identity in this category is the following: 


. bi 6h fis — 
Meant = i gr | = 5168, — nlf (191) 


This identity can be obtained from the identity of Eq. 189 by replacing n with k, that is: 


CF emp = 6462,06 + 6L, 526" + 5LaTS* — 515258 — 5° 66k — 54.62, 5F (192) 
30162, + 61,6) + 6,6) — 6762, — 361,57 — 5762, 

5j6), — 51,6) 

_ | J, 5m 
| a 6d, 


The pattern of the indices in this identity (Eq. 191) is as before if we exclude the repetitive 
index k. 

The identity of Eq. 191, which may be called the epsilon-delta identity or the contracted 
epsilon identity or the Levi-Civita identity, is very useful in manipulating and simplifying 
tensor expressions and proving vector and tensor identities; examples of which will be seen 
in § 7.1.5. We note that the determinantal form, seen in the middle equality of Eq. 191, 
can be considered as a mnemonic device for this identity where the first and second indices 
of the first € index the rows while the first and second indices of the second € index the 
columns, as given above. In fact, the determinantal form in all the above equations which 
are given in this form is a mnemonic device for all these equations, and not only Eq. 191, 
where the expanded form, if needed, can be easily obtained from the determinant which 
can be easily built following the simple pattern of indices, as explained above. 

Other common identities in this category are: 


cette, = 26 (193) 
ey, = 2 =2x3=3!=6 (194) 


The first of these identities can be obtained from Eq. 191 with the replacement of m with 
j followed by some basic tensor manipulation, that is: 


cers, = 6,0, — O46 (Eq. 191 with m > 7) (195) 
= 36; — 6 (Eqs. 174 and 173) 
20! 


while the second of these identities can be obtained from the first by replacing / with i 
and applying the summation convention in 3D on the right hand side, i.e. using Eq. 174. 
The second identity is, in fact, an instance of Eq. 181 for a 3D space. 

Another common identity that involves the rank-3 permutation tensor with the ordinary 
Kronecker delta in a 3D space is the following: 


€1jk010308 = €123 = 1 (196) 
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This identity is based in its first part on the use of the ordinary Kronecker delta as an 
index replacement operator (Eq. 172), where each one of the deltas replaces an index in 
the permutation tensor, and is based in its second part on the definition of the permutation 
tensor (Eq. 155). 

Finally, the following identity can also be obtained from the definition of the rank-3 
permutation tensor (Eq. 155) and the use of the ordinary Kronecker delta as an index 
replacement operator (Eq. 172): 


eer get SY rear aD + eee 1 eee pe, ee 
€ijkO; = €ijkOK = €ijk0; = CxjKO%, = €ijk0; = €ijkO; = 0 (197) 


This is because on replacing one of the indices of the permutation tensor it will have two 
identical indices and hence it is zero, e.g. €ijk0; = €)jk = 0. We note that identities like 
Eqs. 196 and 197 also apply to the opposite variance type. Also, Eqs. 196 and 197 apply 
to the permutation tensor of other ranks with some simple modifications. 


4.4 Generalized Kronecker delta Tensor 


The generalized Kronecker delta tensor in an nD space is an absolute rank-2n tensor of 
type (n,n) which is normally defined inductively by: 


1 (j1---Jn is even permutation of i; ...i,) 
eae =<-1 (j1---Jn is odd permutation of i, ...7,) (198) 
0 (repeated 7’s or j’s) 


It can also be defined analytically by the following n x n determinant: 


on Of on 

cg, | 02 16 ake 62 

vite J J2 Jn 

es ee (199) 
on On on 


where the i entries in the determinant are the ordinary Kronecker deltas as defined 


previously (see § 4.1). In this equation, the pattern of the indices in the generalized 
Kronecker delta symbol ans in connection to the indices in the determinant is similar to 
the previous patterns seen in § 4.3.3, that is the upper indices in eee provide the upper 
indices in the ordinary deltas by indexing the rows of the determinant, while the lower 
indices in Ce provide the lower indices in the ordinary deltas by indexing the columns 
of the determinant. 


From the previous identities, it can be shown that: 


ay a4 ay 

On Oi5 Oi 

62 §2 1... §2 

ay ..0 = np J2 In 
CONNER Gh ae ag. oo (200) 

bi dn Lin: 

OF O% Oj, 
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Now, on comparing the last equation with the definition of the generalized Kronecker 
delta, i.e. Eq. 199, we conclude that: 


igeies a Ry 
€ Cpiijn Oj is (201) 


Based on Eq. 201, the generalized Kronecker delta is the result of multiplying two relative 
tensors one of weight w = +1 and the other of weight w = —1 and hence the generalized 
Kronecker delta has a weight of w = 0. This shows that the generalized Kronecker delta is 
an absolute tensor, as stated above. We remark that the multiplication of relative tensors 
produces a tensor whose weight is the sum of the weights of the original tensors (see § 
S143); 

From Eq. 201, we can see that the relation between the rank-n permutation tensor in 
its covariant and contravariant forms and the generalized Kronecker delta in an nD space 
is given by: 


Cian ere es gs Ss (202) 


11...An 


Ge RNS Cite Olen (203) 


where the first of these equations is obtained from Eq. 201 by substituting (1...n) for 
(i; ...%,) in the two sides with relabeling j as 7 and noting that e+" = 1, while the second 
equation is obtained from Eq. 201 by substituting (1...) for (j1...j,) in the two sides 
and noting that €,.., = 1. 

Based on Eqs. 202 and 203, the permutation tensor can be considered as an instance 
of the generalized Kronecker delta. Consequently, the rank-n permutation tensor can be 
written as an n x n determinant consisting of the ordinary Kronecker deltas (Eq. 199). 
Moreover, Eqs. 202 and 203 can provide another definition for the permutation tensor in 
its covariant and contravariant forms, in addition to the previous inductive and analytic 
definitions of this tensor which are given by Eqs. 156 and 160. 

The 3D generalized Kronecker delta may be symbolized by ee If we replace n with 
k; and use the determinantal definition of the generalized Kronecker delta of Eq. 199 
followed by conducting a few basic algebraic manipulations using some of the above tensor 
identities, we obtain: 


a | Oh Om Ok 
jp. =| 6 6, & (Eq. 199) (204) 
of Om Of 


= 6; (67,04 — 525%.) + 54, (SL0f — df 68) + 6), (676%, — 5,67) 
= 6161, 6k — 556 + 5' 526% — 6b 675R + O75" — 6462, 5F 


= 6154, oF — 5154, + 615) — 5¢ OF dk + 61,6) — 6152, (Eq. 173) 
= 36352, — 5162, + 61,6) — 362,67 + 61,5) — 5463, (Eq. 174) 
= 015%, — 5nd 


_|% Sn 
1G oh, 
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= (Eq. 199) 


that is: 

Simk = Sim (205) 
Similar identities can be obtained from contracting two corresponding indices of the nD 
generalized Kronecker delta to obtain (n — 1)D generalized Kronecker delta, e.g. 


Oy 0} (206) 
which can be simply verified following the former example, that is: 


- jt Ot ee sais ; 
Oi; = | 5) i) | = 6/0; — 6,6] = 26, — 0, = 0; (207) 
l J 
Returning to the widely used epsilon-delta identity of Eq. 191, if we consider Eq. 205 
plus the above identities which correlate the permutation tensor, the generalized Kronecker 
delta tensor and the ordinary Kronecker delta tensor, then an identity equivalent to Eq. 


191 that involves only the generalized and ordinary Kronecker deltas can be obtained, that 
- [20] 
is: 


€F emp = OFF, (Eq. 201) (208) 
= 5 (Eq. 205) 
53 6k, 
-| 5! i (Eq. 199) 


= 0154, — nd 
This means that the following relation: 


Simak = Sim = O15hn — Fmd (209) 
which is no more than the definition of the generalized Kronecker delta of Eq. 199 is 
another form of the epsilon-delta identity. The pattern of the indices on the right hand 
side in relation to the indices of the generalized Kronecker delta is very simple, that is 
we take first the corresponding upper and lower indices followed by the diagonally crossed 
indices (i.e. || — x). It is worth noting that the epsilon-delta identity (Eqs. 191 and 209) 
can also be expressed in a more general form by employing the metric tensor with the 
absolute permutation tensor, that is: 


9 €x1€ jn = GemGin — Jun Jim (210) 


Other identities involving the permutation tensor and the ordinary Kronecker delta ten- 
sor can also be formulated in terms of the generalized Kronecker delta tensor. 


°lTn fact, this is a derivation of Eq. 191. 
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4.5 Metric Tensor 


The metric tensor, which may also be called the fundamental tensor, is a rank-2 sym- 
metric absolute non-singular tensor, where “non-singular” means invertible and hence its 
determinant does not vanish at any point in the space. The metric tensor is one of the 
most important special tensors in tensor calculus, if not the most important of all. Its 
versatile usage and functionalities permeate the whole discipline of tensor calculus and 
its applications. One of the main objectives of the metric is to generalize the concept 
of distance to general coordinate systems and hence maintain the invariance of distance 
in different coordinate systems, as will be explained next. This tensor is also used to 
raise and lower indices and thus facilitate the transformation between the covariant and 
contravariant types. As a tensor, the metric has significance regardless of any coordinate 
system although it requires a coordinate system to be represented in a specific form (see § 
2.7). So, in principle the coordinate system and the space metric are independent entities. 

In an orthonormal Cartesian coordinate system of an nD space the length of infinitesimal 
element of arc, ds, connecting two neighboring points in space, one with coordinates x; 
and the other with coordinates x; + dx; (i = 1,--- ,n), is given by: 


This definition of distance is the key to introducing a rank-2 tensor, g,;, called the metric 
tensor which, for a general coordinate system, is defined by: 


(ds)? = g,jdu'dud (212) 


where the indexed u represents general coordinates. The metric tensor in the last equation 
is of covariant form. The metric tensor has also a contravariant form which is notated 
with g’?. It is common to reserve the term “metric tensor” to the covariant form and call 
the contravariant form, which is its inverse, the associate or conjugate or reciprocal metric 
tensor. 

The components of the metric tensor in its covariant and contravariant forms are closely 
related to the basis vectors, that is: 


Qj = E,;-E; (213) 
gi = E'-E (214) 


where the indexed E are the covariant and contravariant basis vectors as defined previously 
(see § 2.6). Because of these relations, the vectors E; and E’ may be denoted by g; and 
g’ respectively which is more suggestive of their relation to the metric tensor. Similarly, 
the mixed type metric tensor is given by: 


gi, =E'-E, =5; g? =E;-E’ = 6/7 (215) 


and hence it is the identity tensor. These equalities, which may be described as the 
reciprocity relations, represent the fact that the covariant and contravariant basis vectors 
are reciprocal sets. 
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As a consequence, the covariant metric tensor is given, in full tensor notation, by: 


Ox* Oax* 
Ee alarm reece! (216) 
Ou’ Oud 
where 
PST Geog) Cee ds escent) (217) 
are independent coordinates in an nD space with a rectangular Cartesian system, and u* 
and uw’ (i,7 =1,...,n) are independent general coordinates. Similarly, for the contravari- 
ant metric tensor we have: ee 
* uw’ Ou 
ij ae (218) 
On On" 


As stated already, the basis vectors, whether covariant or contravariant, in general co- 
ordinate systems are not necessarily mutually orthogonal and hence the metric tensor is 
not diagonal in general since the dot products given by Eqs. 213 and 214 (or Eqs. 216 
and 218) are not necessarily zero when i 4 7. Moreover, since those basis vectors vary in 
general in their magnitude and relative orientations and they are not necessarily of unit 
length, the entries of the metric tensor, including the diagonal elements, are not neces- 
sarily of unit magnitude. Also, the entries of the metric tensor, including the diagonal 
elements, can be positive or negative.?!] However, since the dot product of vectors is 
a commutative operation, the metric tensor is necessarily symmetric. We note that the 
mixed type metric tensor is diagonal (or in fact the unity tensor) because the covariant 
and contravariant basis vector sets are reciprocal systems (see Eq. 215). This applies in 
all coordinate systems. 

As indicated above, the covariant and contravariant forms of the metric tensor are in- 
verses of each other and hence we have the following relations: 


ij) —t ig —1 
[9a] = [9”7] [9] = [9s] (219) 
Hence: 
9 95 = 8; ging’? = 5; (220) 
where these equations can be seen to represent matrix multiplication (rowxcolumn). A 


result that can be obtained from the previous statements plus Eqs. 138, 213 and 214 is 
that: 


E'E, : EYE, = (E'- E’) (Ej-Ex) = g¥ojn = 5%, (221) 
E,E’ : EE’ = (E;-E;) (E)-E*) = gyg%* = 6,4 (222) 


Since the metric tensor has an inverse, it should be non-singular and hence its determinant, 
which in general is a function of coordinates like the metric tensor itself, should not vanish 
at any point in the space, that is: 


g(u',...,u”) = det (qj) 4 0 (223) 


1l The diagonal entries can be negative when the coordinates are imaginary (see § 2.2.3 and Eq. 241). 


4.5 Metric Tensor 93 


From the previous statements, it may be concluded that the metric tensor is in fact 
a transformation of the Kronecker delta tensor in its different variance types from an 
orthonormal Cartesian coordinate system to a general coordinate system, that is (see Eqs. 


73-75): 


95 = woot = BR = EE; (covariant) (224) 


y pa audit T E’ - E/ (contravariant ) (225) 
» Ou dz! Our Ox* ; 


As stated above, the metric tensor is symmetric in its two indices, that is: 
Gig = Qyi gq S go" (227) 


This can be easily explained by the commutativity of the dot product of vectors in reference 
to the above equations involving the dot product of the basis vectors (Eqs. 213 and 214). 
Because of the relations: 


AO = A-E'=4A;E’-E’ = Ajg” (228) 


the metric tensor is used as an operator for raising and lowering indices and hence facili- 
tating the transformation between the covariant and contravariant types of vectors. By a 
similar argument, the above can be generalized where the contravariant metric tensor is 
used for raising covariant indices of covariant and mixed tensors and the covariant metric 
tensor is used for lowering contravariant indices of contravariant and mixed tensors of any 
rank, e.g. 

Al, = g Aj Ae = gy A™ (230) 


Consequently, any tensor in a Riemannian space with well-defined metric can be cast into 
covariant or contravariant or mixed forms where for the mixed form the rank should be 
> 1. We note that in the operations of raising and lowering of indices the metric tensor 
acts, like the Kronecker delta tensor, as an index replacement operator beside its action 
in shifting the index position. 

In this context, it should be emphasized that the order of the raised and lowered indices 
is important and hence: 


g* Aj, = A; and Anz = A’, (231) 


are different in general. A dot may be used to indicate the original position of the shifted 
index and hence the order of the indices is recorded, e.g. A;' and A? ; for the above 
examples respectively. Because raising and lowering of indices is a reversible process, 
keeping a record of the original position of the shifted indices will facilitate the reversal if 
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needed. We note that dots may also be inserted in the symbols of mixed tensors to remove 
any ambiguity about the order of the indices even without the action of the raising and 
lowering operators (refer to § 1.2). 

For a space with a coordinate system in which the metric tensor can be cast into a 
diagonal form with all the diagonal entries being of unity magnitude (i.e. +1) the space 
and the metric are called flat. For example, in 2D manifolds a plane surface is a flat 
space since it can be coordinated by an orthonormal 2D Cartesian system which results 
into a diagonal unity metric tensor since the basis vectors of this system are mutually 
perpendicular constant vectors and each is of unit length. On the other hand, an ellipsoid 
is not a flat space (i.e. it is curved) because due to its intrinsic curvature it is not possible 
to find a coordinate system for this type of surface whose basis vectors produce a diagonal 
metric tensor with constant diagonal elements of unity magnitude. In this context we note 
that a Riemannian metric, g;;, in a particular coordinate system is a Euclidean metric if it 
can be transformed to the identity tensor, 0;;, by a permissible coordinate transformation. 

If g and g are the determinants of the covariant metric tensor in unbarred and barred 
coordinate systems respectively, i.e. g = det (g;;) and g = det (g;;), then we have: 


g= Ig Va=JVG (232) 
where J (= Gul) is the Jacobian of the transformation between the unbarred and barred 
systems. Consequently, the determinant of the covariant metric tensor and its square root 
are relative scalars of weight +2 and +1 respectively (see § 3.1.3). 

A “conjugate” or “associate” tensor of a tensor in a metric space is a tensor obtained 
by inner product multiplication, once or more, of the original tensor by the covariant or 
contravariant forms of the metric tensor. All tensors associated with a particular tensor 
through the metric tensor represent the same tensor but in different base systems since 
the association is no more than raising or lowering indices by the metric tensor which is 
equivalent to a representation of the components of the tensor relative to different basis 
sets. 

A sufficient and necessary condition for the components of the metric tensor to be con- 
stants in a given coordinate system is that the Christoffel symbols of the first or second 
kind vanish identically (refer to 5.1). This may be concluded from Eqs. 307 and 308. The 
metric tensor behaves as a constant with respect to covariant and absolute differentiation 
(see § 5.2 and § 5.3). Hence, in all coordinate systems the covariant and absolute deriva- 
tives of the metric tensor are zero. Accordingly, the covariant and absolute derivative 
operators bypass the metric tensor in differentiating inner and outer products of tensors 
involving the metric tensor. 

In orthogonal coordinate systems in an nD space the metric tensor in its covariant and 
contravariant forms is diagonal with non-vanishing diagonal elements, that is: 


Gj = 9 g =0 (#3) (233) 
Gi FO g £0 (no sum on 2) (234) 
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Moreover, we have: 


1 
Gi = (hy)? = Po (no sum on #) (235) 
det (9:3) = 9 = 911922 --- Inn = I] (hi)? (236) 


a 


=i 
es 1 1 2 
det (g”) =-= = h; Dat 
( ) G 911 922+++ Inn IM ) | ee) 
where h; (= |E;|) are the scale factors, as described previously in § 2.5 and 2.6. 
As indicated before, for orthonormal Cartesian coordinate systems in a 3D space, the 
metric tensor is given in its covariant and contravariant forms by the 3 x 3 unit matrix, 


that is: 


l9j]=[6;)=] 9 1 O | = [6] = [9%] (238) 
0 0 | 


For cylindrical coordinate systems of 3D spaces identified by the coordinates (p, ¢, z), the 
metric tensor is given in its covariant and contravariant forms by: 


1 0 O 1 0 0 
[gi] = 0 p? 0 ie= 0 2 0 (239) 
0 0 1 0 0 1 


while for spherical coordinate systems of 3D spaces identified by the coordinates (r, 6, 0), 
the metric tensor is given in its covariant and contravariant forms by: 


1 0 0 - D0, - 
il =| 9 7? 0 [J=| 0 2 0 (240) 
0 0 r?sin?@ 0 0 si 


As seen in Eqs. 238-240, all these metric tensors are diagonal since all these coordinate 
systems are orthogonal. We also notice that all the corresponding diagonal elements of 
the covariant and contravariant types are reciprocals of each other. This can be easily 
explained by the fact that these two types are inverses of each other, moreover the inverse 
of an invertible diagonal matrix is a diagonal matrix obtained by taking the reciprocal 
of the corresponding diagonal elements of the original matrix, as it is known from linear 
algebra. We also see that the diagonal elements are the squares of the scale factors of 
these systems or their reciprocals (refer to Table 1). 

The Minkowski metric, which is the metric tensor of the 4D space-time of the mechanics 
of Lorentz transformations, is given by one of the following two forms: 


0 O 0 — 


1 1 
PP 0 -1 0 0 if 0 

[94] a [9 | = 0 0 —1 0 [9:4] = Lg | = 0 
0 0 


oO @ =1 


(241) 


OoOrooe 
eH OO O&O 
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Consequently, the length of line element ds can be imaginary (see Eqs. 36 and 212). As 
seen, this metric belongs to a flat space since it is diagonal with all the diagonal entries 
being +1. 

The partial derivatives of the components of the covariant and contravariant metric 
tensor satisfy the following identities: 


On9i; = =F GOK (242) 
ng? = —g'™ Gg" OKnGnm (243) 


A related formula for the partial derivatives of the components of the covariant and con- 
travariant metric tensor is given by: 


JimOng”™ = — 9"! OnGien (244) 


This relation can be obtained by partial differentiation of the relation ging” = 6! (Eq. 
220) with respect to the k’” coordinate using the product rule, that is: 


Ox (Ggimg™) = gimOng”™ + 9°" OnGim = 62=0 = — gimOkg™ =—G" Ongim (245) 


The last step in the differentiation (i.e. O,,52 = 0) is justified by the fact that the compo- 
nents of the Kronecker delta tensor are constants. 

In fact, Eq. 242 can be obtained form Eq. 244 by relabeling m as n and multiplying 
both sides of Eq. 244 with g,,; followed by contraction and exchanging the labels of 7 and 
m, that is: 


g™ OnGim = —JimOng"™ (Eq. 244) (246) 
9" Ongin = —GinOng”™ (m — n) (247) 
Imi I" Ok Gin = —Img Gin Ong” (X9mj) (248) 
5” Ongin = —Jmj GinOng”™ (Eq. 220) (249) 
Oa, = ~9mg JinOng™! (Eq. 172) (250) 

OngGi3 = —ImgGniOngr”™ (m + j) (251) 


Similarly, Eq. 243 can be obtained form Eq. 244 by multiplying both sides of Eq. 244 
with g’” followed by contraction and exchanging the labels of i and n, that is: 


JimOng”™ = 9g One Gim ( ( ) 
GG Od =o Oia: ( (253) 
OmOng”? = —9'" 9" OnGim (Eq. 220) (254) 
deg” = —g'"g"™ On gim ( (255) 

dg”? = —g"™ g On gam ( (256) 
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4.6 Definitions Involving Special Tensors 


In the following subsections, we investigate a number of mathematical objects and oper- 
ations whose definitions and applications are dependent on the above described special 
tensors, particularly the permutation and metric tensors. These are just a few examples 
of tensor definitions involving the special tensors and hence they are not comprehensive 
in any way. 


4.6.1 Dot Product 


The dot product of two basis vectors in general coordinate systems was given earlier in § 
4.5 (see Eqs. 213-215). This will be used in the present subsection to develop expressions 
for the dot product of vectors and tensors in general. 

The dot product of two vectors, A and B, in general coordinate systems using their 
covariant and contravariant forms, as well as opposite forms, is given by: 


A-B = A,E'- BE! = A;B;E'- E! = g A,B; = AB; = A,B" (257) 
A-B = A‘E;: BIE; = A‘ BIE; -E; = 9;;A'B? = A,B! = A'B; (258) 
A-B = AE’. BE, = A,B/E' - E, = 6.A;B) = A;B! (259) 
A-B = A'E;- BjE! = A'B,E,-E! = 6) A'B; = A‘B; (260) 


In brief, the dot product of two vectors is the dot product of their two basis vectors 
multiplied algebraically by the algebraic product of their components. Because the dot 
product of the basis vectors is a metric tensor, the metric tensor will act on the components 
by raising or lowering the index of one component or by replacing the index of a component, 
as seen in the above equations. 

The dot product operations outlined in the previous paragraph can be easily extended 
to tensors of higher ranks where the covariant and contravariant forms of the components 
and basis vectors are treated in a similar manner to the above examples to obtain the dot 
product. For instance, the dot product of a rank-2 tensor of contravariant components A‘! 
and a vector of covariant components B;, is given by: 


A-B = (A%E,E,) - (B,E*) = AY B, (E,E; - E*) = A’ B, Eo} = AY B,E, (261) 
that is, the i” component of this product, which is a contravariant vector, is: 
[A - B]' = AYB, (262) 


From the previous statements, we conclude that the dot product in general coordinate 
systems occurs between two vectors of opposite variance type. Therefore, to obtain the dot 
product of two vectors of the same variance type, one of the vectors should be converted 
to the opposite type by the raising or lowering operator, followed by the inner product 
operation. This can be generalized to the dot product of higher-rank tensors where the 
two contracted indices of the dot product should be of opposite variance type and hence 
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the index shifting operator in the form of the metric tensor should be used, if necessary, 
to achieve this. We note that the generalized dot product of two tensors is invariant under 
permissible coordinate transformations. We also note that the variance type of the tensors 
involved in an inner product operation is irrelevant for orthonormal Cartesian systems, as 
explained before. 


4.6.2 Magnitude of Vector 


The magnitude of a contravariant vector A is given by: 


JA|=VA-A= \ (4B) - (AVE;) = \/(Ei- E;) A‘Ad = git Al = VASAT = V/APA; 

(263) 

where Eqs. 213 and 229 are used. A similar expression can be obtained for the covariant 
form of the vector, that is: 


JA) =VA-A= (AE) -(AjE’) = ,/(Ei- BE?) A;A; = /g% AjAj = VAA; = V AA? 

(264) 

where Eqs. 214 and 228 are used. The magnitude of a vector can also be obtained more 

directly from the dot product of the covariant and contravariant forms of the vector, that 
is: 


JAJ=VA-A= VA’) . (AJE;) = (BE) Adi = [OA AI = VA;Ai = V/A; AI 
( 


265) 


where Eqs. 215 and 172 are used. 


4.6.3. Angle between Vectors 


The angle @ between two contravariant vectors A and B is given by: 


cos 9 = = = 266 
AB] /#E, AE, JE"E BE, /qddyaanpep 7°) 
Similarly, the angle 6 between two covariant vectors A and B is given by: 
A-B A;E' - B,E’ IA,B; 
cost = = J = f J (267) 
For two vectors of opposite variance type we have: 
9 A-B A‘’B; A,B’ (268) 
|A| |B V gn A* A! V g" By, Ba, V gH A.A, V Grin eB” 
All these three cases can be represented by the following formula: 
A‘’B; A;Bi 
cos @ = / (269) 


VA,A'/B,B" VWA,A'V/B,B” 
The angle 6 between two sufficiently smooth space curves, C; and C%, intersecting at a 
given point P in the space is defined as the angle between their tangent vectors, A and 
B, at that point (see Figure 17). 
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Figure 17: The angle between two space curves, C, and C2, as the angle @ between their 
tangents, A and B, at the point of intersection P. 


4.6.4 Cross Product 


The cross product of two covariant basis vectors in general coordinate systems of a 3D 
space is given by: 


Ox! Oxr™ Ox! Ox™ Ox! Ox™ 
= Te] X = Tem = TT El XK Om = RT aT Elmn& 
Ou' Ow" — Au’ Ow ™ Out Oui” 


where the indexed x and u are the coordinates of Cartesian and general coordinate sys- 
tems respectively, the indexed e are the Cartesian basis vectors and €jm, is the rank-3 
permutation relative tensor as defined by Eq. 155. In the last step of the last equation, 
Eq. 184 is used to express the cross product of two orthonormal vectors in tensor notation. 
We note that for orthonormal Cartesian systems, there is no difference between covariant 
and contravariant tensors and hence e; = e’. We also note that for orthonormal Cartesian 
systems g = 1 where g is the determinant of the covariant metric tensor (see Eqs. 238 and 
236). 
Now since e, = e” = or EF, the last equation becomes: 


Ox! Ox™ Ox” 
Bax Ea ona oul Ou 


where the underlined absolute covariant permutation tensor is defined as: 


E* = ¢,,,E" (271) 


Ox! Ox™ Ox” 


&ijk = Fi Ow Duk (272) 


So the final result is: 
E; x Ej = €;;,E" (273) 
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By a similar reasoning, we obtain the following expression for the cross product of two 
contravariant basis vectors in general coordinate systems: 


E! x EY = &)*E, (274) 
where the absolute contravariant permutation tensor is defined as: 


ijk eats ee kee 
Ox! Ox™ Ox” 


i And Ark 
= Ou’ Ou! Ou _imn (275) 


Considering Eq. 166, the above equations can also be expressed as: 


€:j.E* = V9¢ijnE* (276) 


ijk 


Ei; x E; 
€ 


V9 


The cross product of non-basis vectors follows similar rules to those outlined above for 
the basis vectors; the only difference is that the algebraic product of the components is 
used as a scale factor for the cross product of their basis vectors. For example, the cross 
product of two contravariant vectors, A’ and BY’, is given by: 


Exm = ch, = 


EK, (277) 


A x B = (A’E,) x (B’E,;) = A'B! (E; x Ej) = ¢;,4°B’E* (278) 


that is, the k’” component of this product, which is a vector with covariant components, 
is: 


[A x B]), = €;,4°B? (279) 


Similarly, the cross product of two covariant vectors, A; and B,, is given by: 
A x B= (A,E’) x (B,E’) = A;B; (E’ x EB’) = €*A,B,E, (280) 
with the k’” contravariant component being given by: 


[A x B]* = &* A,B; (281) 


4.6.5 Scalar Triple Product 


The scalar triple product of three contravariant vectors in a 3D space is given by: 


A-(B x C) = A‘E; - (B’E; x C*E,) (282) 
= A'BIC*E, - (E; x Ex) 
= A'BIC*E; - (€;E') (Eq. 273) 
= €;,,A'B'C* (E; - E') 
= €;,A'°BIC*6; (Eq. 215) 


= €_A'BIC* (Eq. 172) 
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= €;;,A°BIC* (Eq. 180) 


Similarly, the scalar triple product of three covariant vectors in a 3D space is given by: 


A-(B x C) = AjE'- (BjE’ x C,E*) (283) 
= A,B;C;,E’ - (BE x E*) 
= A,B;C,E' - (e"E;) (Eq. 274) 
=<!" 4, B,C; (E' - Ey) 
= J AB;C;,6, (Eq. 215) 
= 3 AB,C; (Eq. 172) 
= 3" A,B,C; (Eq. 180) 


Other forms can also be obtained by a similar method. 


4.6.6 Vector Triple Product 


The vector triple product of three vectors, one contravariant and two covariant, in a 3D 
space is given by: 
A x (B x C) = A’E; x (B;E’ x C,E*) (284) 
= A'B;C,,E; x (E’ x E*) 
A'B;C,E; x (€!"E,) (Eq. 274) 
= ¢!*' A’ B,C; (E; x E)) 
= J AB C64,” (Eq. 273) 
= eye A‘B,CLE™ 
= €ime A’ B;C,E™ (Eq. 166) 


Following relabeling of indices and writing in a covariant component form, we obtain the 
following more organized expression: 


[A x (B x C)], = ejne™" A BC rn (285) 


We can also obtain a different form from one covariant vector and two contravariant 
vectors, that is: 


A x (B x C) = A;E’ x (BYE; x C*Ex) (286) 
= A;BIC*E! x (E; x Ex) 
= A,BIC*E' x (€;,)E') (Eq. 273) 
= €;,Ai:B'C* (E’ x E’) 
= €5,,4:B°C*e "Em (Eq. 274) 
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ilm i vk 


= 6, A;BIC*Em (Eq. 166) 


Following relabeling of indices and writing in a contravariant component form, we obtain 
the following more organized expression: 


[A x (B x C)]* = €¥ eg A,;B'C™ (287) 
The expressions of the other principal form of the vector triple product, i.e. (A x B) x C, 
can be obtained by a similar method. 
4.6.7 Determinant of Matrix 


For a 3 x 3 matrix representing a rank-2 tensor A of mixed form in a 3D space, the 
determinant is given by: 


At Ab Al 
veers ee sia 
det (A) =| A? A2 A2 | = <e%*e,,,,A! LAP AD = Oe ALAM AD (288) 
1 2 3 


where Eq. 201 is used in the last step. This may be expressed as a row expansion by 
substituting 1,2,3 for 1,m,n that is: 


det (A) =e" Ai A? AR (289) 
It can also be expressed as a column expansion by substituting 1, 2,3 for 2,7, k that is: 
det (A) = €tmnAj An AR (290) 


More generally, for an n x n matrix representing a rank-2 tensor A of mixed form in an 
nD space, the determinant is given by: 


ih, 
det (A) = ae Bees gj Alt ae (291) 
n} 


Similar definitions can be given using the covariant and contravariant forms of the tensor 
with the employment of the opposite variance type of the permutation tensors. 


4.6.8 Length 


The differential of displacement vector in ee coordinate systems is given by: 


Or 


ar By 


= E,du' = 


lg ae = 5—|E,| Ejdu' (292) 


where r is the position vector as defined oe and the hat indicates a normalized 
vector. The length of line element, ds, which may also be called the differential of arc 
length, in general coordinate systems is given by: 


(ds) = dr - dr = E,du' - Ejdu) = (E,- E,) du'du? = gjjdu'du’ (293) 
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where gj; is the covariant metric tensor. 
For orthogonal coordinate systems, the metric tensor is given by: 


__ fo ea) 
a {i =3) a 


where h, is the scale factor of the i” coordinate. Hence, the last part of Eq. 293 becomes: 


(ds) = $0 (hy)* du'du! (295) 


a 


with no cross terms (i.e. terms of products involving more than one coordinate like du’du/ 
where i # 7) which are generally present in the case of non-orthogonal coordinate systems. 

On conducting a transformation from one coordinate system to another coordinate sys- 
tem, where the other system is marked with barred coordinates wu, the length of line element 
ds will be expressed in the new system as: 


(ds)? = Gi,di'da! (296) 


Since the length of line element is an invariant quantity, the same symbol (ds)? is used in 
both Eqs. 293 and 296. 

Based on the above formulations, the length L of a t-parameterized space curve C(t) 
defined by u’ = u'(t) where i = 1,--+ ,n, which represents the distance traversed along the 
curve on moving between its start point P; and end point Ps, is given in general coordinate 
systems of an nD space by: 


me — du’ dud 
Cc Pi ty 


where C’ represents the space curve, t is a scalar real variable, and t, and tz are the values 
of t corresponding to the start and end points respectively. It is noteworthy that some 
authors add a sign indicator to ensure that the argument of the square root in the above 
equation is positive. However, as indicated in § 1.1, such a condition is assumed when 
needed since in this book we deal with real quantities only. 


4.6.9 Area 


In general coordinate systems of a 3D space, the area of an infinitesimal element of the 
surface ut = c; in the neighborhood of a given point P, where c is a constant, is obtained 
by taking the magnitude of the cross product of the differentials of the displacement vectors 
in the directions of the other two coordinates on that surface at P, i.e. the tangent vectors 
to the other two coordinate curves at P (see Figure 18). Hence, the area of a differential 
element on the surface ut = c, is given by: 


do(u* = c,) = |dr2 x drs| (298) 
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Or Or 


=l5,2% a8 du*du® 

= |E, x E3| du?du® (Eq. 45) 

= €53E' | du*du? (Eq. 273) 

= |é31| |E"| du*du’ 

= /gVE! - E! du?du? (Eqs. 166 & 263) 


= /9 Vg" du?du? (Eq. 214) 
= gg"! du*du® 


where o represents area and dr2 and dr3 are the displacement differentials along the second 
and third coordinate curves at P while the other symbols are as defined previously. 


ur 


Figure 18: The area do of an infinitesimal element of the surface ut = c; in the neighbor- 


hood of a given point P as the magnitude of the cross product of the differentials of the 
displacement vectors in the directions of the other two coordinate curves on that surface 
at P, dr and drs. 


On generalizing the above argument, the area of a differential element on the surface 
u' = c; (i = 1,2,3) in a 3D space where c; is a constant is given by: 


do(u' = &) = Vggtduidu® (i #7 #k, no sum on 2) (299) 


In orthogonal coordinate systems in a 3D space we have: 


(ri)? 
where Eqs. 235 and 236 are used. Hence, Eq. 299 becomes: 


Jog = |[oorenton es hjh (i #7 #k, no sum on any index) (300) 


do(u’ = c;) = hjhydutdu® (i 4j #k, no sum on any index) (301) 
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The last formula represents the area of a surface element of a rectangular shape with sides 
h;dud and hydu* (no sum on j or k). 
Based on the above discussion, the area A of a finite surface patch can be defined by the 


following formula: 
A= I do (302) 
Ss 


where S represents a surface patch and do is the area differential of an infinitesimal 
element on the patch. For coordinate surfaces, the expression for do can be obtained from 
the previous formulations. 


4.6.10 Volume 


In general coordinate systems of a 3D space, the volume of an infinitesimal element of 
a solid body occupying a given region of the space in the neighborhood of a given point 
P, where the element is represented by a parallelepiped defined by the three differentials 
of the displacement vectors dr; (i = 1,2,3) along the three coordinate curves at P, is 
obtained by taking the magnitude of the scalar triple product of these vectors (refer to § 
4.6.5 and see Figure 19). Hence, the volume of a differential element of the body is given 
by: 


dr = |dr, - (dra x dr3)| (303) 
Or Or Or At acta 
haat ($5 x | du’ du*du 
= E, : (E, x Es)| du'du*du® (Eq. 45) 
= |Ey - 6)3,E"| du'du?du® (Eq. 273) 
= |E, - E'| |€o3,| du'du?du® 
= by | |€o31| dutdu?du® (Eq. 215) 
= /gdu'du7du? (Eq. 166) 
= J du du*du q. 63 
J du'du?du® E 


where g is the determinant of the covariant metric tensor g;;, and J is the Jacobian of 
the transformation as defined previously (see § 2.3). We note that due to the freedom 
of choice of the order of the variables, which is related to the choice of the coordinate 
system handedness that could affect the sign of the determinant Jacobian, the sign of the 
determinant should be adjusted if necessary to have a proper sign for the volume element 
as a positive quantity. The last line in the last equation is particularly useful for changing 
the variables in multivariate integrals where the Jacobian facilitates the transformation. 
In orthogonal coordinate systems in a 3D space, the above formulation becomes: 


dr = hihzh3 du'du?du® (304) 


where h,, hy and hz are the scale factors and where Eq. 236 is used. Geometrically, the 
last formula represents the volume of a rectangular parallelepiped with edges h,du+, hgdu? 
and h3du’. 
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Figure 19: The volume of an infinitesimal element of a solid body in a 3D space in the 
neighborhood of a given point P as the magnitude of the scalar triple product of the 
differentials of the displacement vectors in the directions of the three coordinate curves at 
P, dr,, drg and drs. 


The formulae of Eq. 303, which are specific to a 3D space, can be extended to the 
differential of a generalized volume element in general coordinate systems of an nD space 
as follows: 


dr =./gdu...du™ = Jdu'...du” (305) 


We note that generalized volume elements are used, for instance, to represent the change 
of variables in multi-variable integrations. 
Based on the above discussion, the volume V of a solid body occupying a finite region 


of the space can be defined as: 
V= II dr (306) 
Q 


where 2 represents the region of the space occupied by the solid body and dr is the volume 
differential of an infinitesimal element of the body, and where the expression for dr should 
be obtained from the above formulations. 
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4.1 


4.2 


4.3 


4.4 


4.5 


4.6 


4.7 


4.8 


4.9 


4.10 


4.11 


4.12 


4.13 


4.14 


4.15 


4.16 


4.17 


4.18 


4.19 


4.20 
4.21 


4.22 


What is special about the Kronecker delta, the permutation and the metric tensors 
and why they deserve special attention? 

Give detailed definition, in words and in symbols, of the ordinary Kronecker delta 
tensor in an nD space. 

List and discuss all the main characteristics (e.g. symmetry) of the ordinary Kronecker 
delta tensor. 

Write the matrix that represents the ordinary Kronecker delta tensor in a 4D space. 
Do we violate the rules of tensor indices when we write: ,; = 6% = 64, = 63? 
Explain the following statement: “The ordinary Kronecker delta tensor is conserved 
under all proper and improper coordinate transformations”. What is the relation 
between this and the property of isotropy of this tensor? 

List and discuss all the main characteristics (e.g. anti-symmetry) of the permutation 
tensor. 

What are the other names used to label the permutation tensor? 

Why the rank and the dimension of the permutation tensor are the same? Accordingly, 
what is the number of components of the rank-2, rank-3 and rank-4 permutation 
tensors? 

Why the permutation tensor of any rank has only one independent non-vanishing 
component? 

Prove that the rank-n permutation tensor possesses n! non-zero components. 

Why the permutation tensor is totally anti-symmetric? 

Give the inductive mathematical definition of the components of the permutation 
tensor of rank-n. 

State the most simple analytical mathematical definition of the components of the 
permutation tensor of rank-n. 

Make a sketch of the array representing the rank-3 permutation tensor where the 
nodes of the array are marked with the symbols and values of the components of this 
tensor. 

Define, mathematically, the rank-n covariant and contravariant absolute permutation 
tensors, €,,;, and e. 

Show that €;;, is a relative tensor of weight —1 and ¢’!* is a relative tensor of weight 
a ale 

Show that ¢,,, = /9 €..i, and e"" = moo are absolute tensors assuming that 


€;,..i, i8 a relative tensor of weight —1 and ¢'!~” is a relative tensor of weight +1. 
Write ¢2"" €;,;,...;,, in its determinantal form in terms of the ordinary Kronecker 
delta. 

Prove the following identity: «2° €;,5..4, = nl. 

State a mathematical relation representing the use of the ordinary Kronecker delta 
tensor acting as an index replacement operator. 

Prove the following relation inductively by writing it in an expanded form in a 3D 
space: 6} = n. 
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4.23 
4,24 
4.25 


4.26 
4.27 


4.28 
4.29 


4.30 


4.31 


4.32 


4.33 


4.34 
4.35 


4.36 
4.37 
4.38 
4.39 


4.40 


4.41 
4.42 


4.43 


4.44 


Repeat exercise 4.22 with the relation: u'; = Oy using a matrix form. 
Justify the following relation assuming an orthonormal Cartesian system: O;7; = 0;2;. 
Justify the following relations where the indexed e are orthonormal vectors: 


e,-e; = Oj E;e; > C,e, = dikOjl 


Show that 6/5*6}, = n. 

Write the determinantal array form of ¢’7€,; outlining the pattern of the tensor indices 
in their relation to the indices of the rows and columns of the determinant array. 
Prove the following identity using a truth table: ee, = 6.5) — 615. 

Prove the following identities justifying each step in your proofs: 


il st ijk sisi i 
€ €k = Op €” Elmk = 0102, ras Ops, 
Prove the following identities using other more general identities: 
e* ersy = 20; a AN = 6 


Outline the similarities and differences between the ordinary Kronecker delta tensor 
and the generalized Kronecker delta tensor. 

Give the inductive mathematical definition of the generalized Kronecker delta tensor 
on _ 
Write the determinantal array form of the generalized Kronecker delta tensor 6j)"" 
in terms of the ordinary Kronecker delta tensor. 


Define €;,..;, and ¢*" in terms of the generalized Kronecker delta tensor. 


Prove the relation: €* €imn = ee using an analytic or an inductive or a truth table 
method. 
Demonstrate that the generalized Kronecker delta is an absolute tensor. 


Prove the following relation justifying each step in your proof: 6,777 = df7". 

Prove the common form of the epsilon-delta identity. 

Prove the following generalization of the epsilon-delta identity: g” GixtEjmn = GkmGin — 
GknJim- 

List and discuss all the main characteristics (e.g. symmetry) of the metric tensor. 
How many types the metric tensor has? 

Investigate the relation of the metric tensor of a given space to the coordinate systems 
of the space as well as its relation to the space itself by comparing the characteristics 
of the metric in different coordinate systems of the space such as being diagonal or 
not or having constant or variable components and so on. Hence, assess the status of 
the metric as a property of the space but with a form determined by the adopted co- 
ordinate system to describe the space and hence it is also a property of the coordinate 
system in this sense. 

What is the relation between the covariant metric tensor and the length of an in- 
finitesimal element of arc ds in a general coordinate system? 

How the relation in question 4.43 will become (a) in an orthogonal coordinate system 
and (b) in an orthonormal Cartesian coordinate system? 
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4.45 


4.46 


4.47 


4.48 


4.49 


4.50 


4.51 


4.52 


4.53 
4.54 


4.55 


4.56 


4.57 


4.58 
4.59 


4.60 


4.61 


4.62 


4.63 


4.64 
4.65 


What is the characteristic feature of the metric tensor in orthogonal coordinate sys- 
tems? 

Write the mathematical expressions for the components of the covariant, contravariant 
and mixed forms of the metric tensor in terms of the covariant and contravariant basis 
vectors, E; and E’. 

Write, in full tensor notation, the mathematical expressions for the components of the 
covariant and contravariant forms of the metric tensor, g;; and g’’. 

What is the relation between the mixed form of the metric tensor and the ordinary 
Kronecker delta tensor? 

Explain why the metric tensor is not necessarily diagonal in general coordinate systems 
but it is necessarily symmetric. 

Explain why the diagonal elements of the metric tensor in general coordinate systems 
are not necessarily of unit magnitude or positive but they are necessarily non-zero. 
Explain why the mixed type metric tensor in any coordinate system is diagonal or in 
fact it is the unity tensor. 

Show that the covariant and contravariant forms of the metric tensor, g;; and g’’, are 
inverses of each other. 

Why the determinant of the metric tensor should not vanish at any point in the space? 
If the determinant of the covariant metric tensor g,;; is g, what is the determinant of 
the contravariant metric tensor g’? 

Show that the metric tensor can be regarded as a transformation of the ordinary 
Kronecker delta tensor in its different variance types from an orthonormal Cartesian 
coordinate system to a general coordinate system. 

Justify the use of the metric tensor as an index shifting operator using a mathematical 
argument. 

Carry out the following index shifting operations recording the order of the indices: 


9 Cres Gmn B's Gg Dee 


What is the difference between the three operations in question 4.57? 

Why the order of the raised and lowered indices is important and hence it should be 
recorded? Mention one form of notation used to record the order of the indices. 
What is the condition that should be satisfied by the metric tensor of a flat space? 
Give common examples of flat and curved spaces. 

Considering a coordinate transformation, what is the relation between the deter- 
minants of the covariant metric tensor in the original and transformed coordinate 
systems, g and g? 

B is a “conjugate” or “associate” tensor of tensor A. What this means? 

Complete and justify the following statement: “The components of the metric tensor 
are constants iff ...etc.”. 

What are the covariant and absolute derivatives of the metric tensor? 

Assuming an orthogonal coordinate system of an nD space, complete the following 
equations where the indexed g represents the metric tensor or its components, i 4 7 
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4.66 


4.67 


4.68 


4.69 


4.70 


4.71 
4.72 
4.73 


4.74 
4.75 


4.76 


4.77 


4.78 


4.79 


4.80 


4.81 


4.82 


in the second equation and there is no sum in the third equation: 


Write, in matrix form, the covariant and contravariant metric tensor for orthonormal 
Cartesian, cylindrical and spherical coordinate systems. What distinguishes all these 
matrices? Explain and justify. 

Referring to question 4.66, what is the relation between the diagonal elements of these 
matrices and the scale factors h; of the coordinates of these systems? 

Considering the Minkowski metric, is the space of the mechanics of Lorentz transfor- 
mations flat or curved? Is it homogeneous or not? What effect this can have on the 
length of element of arc ds? 

Derive the following identities: 


mj 


JimOnG — 9"! Onion OnGi = —Imj IniOngr” 


What is the dot product of A and B where A is a rank-2 covariant tensor and B is a 
contravariant vector? Write this operation in steps providing full justification of each 
step. 

Derive an expression for the magnitude of a vector A when A is covariant and when 
A is contravariant. 

Derive an expression for the cosine of the angle 6 between two covariant vectors, A 
and B, and between two contravariant vectors C and D. 

What is the meaning of the angle between two intersecting smooth curves? 

What is the cross product of A and B where these are covariant vectors? 

Complete the following equations assuming a general coordinate system of a 3D space: 


E; x E; =? E’ x E/ =? 


Define the operations of scalar triple product and vector triple product of vectors 
using tensor language and assuming a general coordinate system of a 3D space. 
What is the relation between the relative and absolute permutation tensors in their 
covariant and contravariant forms? 

Define the determinant of a matrix B in tensor notation assuming a general coordinate 
system of a 3D space. 

Derive the relation for the length of line element in general coordinate systems: (ds)° = 
gijdu'du’?. How will this relation become when the coordinate system is orthogonal? 
Justify your answer. 

Write the integral representing the length L of a t-parameterized space curve in terms 
of the metric tensor. 

Using Eq. 295 plus the scale factors of Table 1, develop expressions for ds in orthonor- 
mal Cartesian, cylindrical and spherical coordinate systems. 

Derive the following formula for the area of a differential element on the coordinate 
surface u' = constant in a 3D space assuming a general coordinate system: 


do(u' = constant) = \/gg"dui du" (i A 7 #k, no sum on 2) 
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4.83 


4.84 


4.85 


4.86 


4.87 


How will this relation become when the coordinate system is orthogonal? 

Using Eq. 301 plus the scale factors of Table 1, develop expressions for do on the 
coordinate surfaces in orthonormal Cartesian, cylindrical and spherical coordinate 
systems. 

Derive the following formula for the volume of a differential element of a solid body 
in a 3D space assuming a general coordinate system: 


dr = \/gdu'du*du® 


How will this relation become when the coordinate system is orthogonal? 

Make a plot representing the volume of an infinitesimal element of a solid body in a 
3D space as the magnitude of a scalar triple product of three vectors. 

Use the expression of the volume element in general coordinate systems of nD spaces 
to find the formula for the volume element in orthogonal coordinate systems. 

Using Eq. 304 plus the scale factors of Table 1, develop expressions for dr in or- 
thonormal Cartesian, cylindrical and spherical coordinate systems. 


Chapter 5 
Tensor Differentiation 


Ordinary differentiation rules of partial and total derivatives do not satisfy the principle 
of invariance when they are applied to tensors in general coordinate systems due to the 
fact that the basis vectors in these systems are coordinate dependent. This means that 
the ordinary differentiation of non-scalar tensor components in general coordinates does 
not necessarily results in a tensor. Therefore, special types of differentiation should be 
defined so that when they apply to tensors they produce tensors. The essence of these 
operations is to extend the differentiation process to the basis vectors to which the tensor 
is referred and not only on the tensor components. 

The focus of this chapter is the operations of covariant and absolute differentiation which 
are closely linked. These operations represent generalization of ordinary differentiation in 
general coordinate systems with an objective of making the derivative of tensors comply 
with the principle of tensor invariance. Briefly, the differential change of a tensor in 
general coordinate systems is the result of a change in the basis vectors and a change 
in the tensor components. Hence, covariant and absolute differentiation, in place of the 
normal differentiation (i.e. partial and total differentiation respectively), are defined and 
employed to account for both of these changes by differentiating the basis vectors as well 
as the components of the tensors. 

Since the Christoffel symbols are crucial in the formulation and application of covariant 
and absolute differentiation, the first section of the present chapter is dedicated to these 
symbols and their properties. The subsequent two sections will then focus on the covariant 
differentiation and the absolute differentiation. 


5.1 Christoffel Symbols 


We start by investigating the main properties of the Christoffel symbols which play crucial 
roles in tensor calculus in general and are needed for the subsequent development of the 
upcoming sections since they enter in the definition of covariant and absolute differenti- 
ation. The Christoffel symbols are classified as those of the first kind and those of the 
second kind. These two kinds are linked through the index raising and lowering operators 
although this does not mean they are general tensors (see next). Both kinds of Christoffel 
symbols are variable functions of coordinates since they depend in their definition on the 
metric tensor which is coordinate dependent in general. 

The Christoffel symbols of the first and second kind are not general tensors although 
they are affine tensors of rank-3. We note that affine tensors (see § 3.1.6) are tensors that 
correspond to admissible linear coordinate transformations from an original rectangular 
system of coordinates. As a consequence of not being tensors, if all the Christoffel symbols 
of either kind vanish in a particular coordinate system they will not necessarily vanish in 
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other coordinate systems (see § 3.1.4). For instance, all the Christoffel symbols of both 
kinds vanish in Cartesian coordinate systems but not in cylindrical or spherical coordinate 
systems, as will be established later in this section. 

The Christoffel symbols of the first kind are defined as: 


ee 1 

lj, t] = 5 (Ojgu + Oiggr — Ogi;) (307) 
where the indexed g is the covariant form of the metric tensor. The Christoffel symbols of 
the second kind are obtained by raising the third index of the Christoffel symbols of the 
first kind, and hence they are given by: 


kl 
eS g 
Py = 9" Ti. = = Giga + Aga — 9%) ON) 


where the indexed g is the metric tensor in its contravariant and covariant forms with 
implied summation over |. Similarly, the Christoffel symbols of the first kind can be 
obtained from the Christoffel symbols of the second kind by reversing the above process 
through lowering the upper index, that is: 


where Eqs. 308, 220 and 172 are used. 
The Christoffel symbols of the first and second kind are symmetric in their paired indices, 
that is: 


j,k] = [jt,k] (310) 
Le (311) 


These properties can be verified by shifting the indices in the definitions of the Christoffel 
symbols, as given by Eqs. 307 and 308, noting that the metric tensor is symmetric in its 
two indices (see § 4.5). 

For an nD space with n covariant basis vectors (E), E2,...,E,,) spanning the space, the 
partial derivative 0,;E; for any given 7 and j is a vector within the space and hence it is in 
general a linear combination of all the basis vectors. The Christoffel symbols of the second 
kind are the components of this linear combination, that is: 


0;B; = TE, (312) 
Similarly, for the contravariant basis vectors (E1,E?,...,E”) we have: 
6,E* = -T,,E* (313) 


By inner product multiplication of Eq. 312 with E* and Eq. 313 with E, we obtain: 


E*.0,E; = +°¢E*-E, =+rhof = +k (314) 
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E,-0;E' = —Ty,E,-E* = -Ti,,6f = -Ty, (315) 


where Eqs. 49 and 172 are used. These equations reveal that the Christoffel symbols of 
the second kind are the projections of the partial derivative of the basis vectors in the 
direction of the basis vectors of the opposite variance type. From Eq. 314 and by using 
the index lowering operator, we obtain a similar relation that links the Christoffel symbols 
of the first kind to the basis vectors and their partial derivatives, that is: 


where Eq. 309 is used in the last step. This equation reveals that the Christoffel symbols 
of the first kind are the projections of the partial derivative of the covariant basis vectors 
in the direction of the basis vectors of the same variance type. 

The partial derivative of the components of the covariant metric tensor and the Christoffel 
symbols of the first kind satisfy the following identity: 


Ongiy = [tk, J] + (9k, 7] (317) 


This relation can be obtained from the partial derivative of the dot product of the basis 
vectors with the use of Eq. 316, that is: 


= (0,E;) -E; + E; - (0,E;) (product rule) 
= [ik, j] + kd (Eq. 316) 


We note that Eq. 317 is closely linked to the upcoming Ricci theorem (see § 5.2), that is: 

Gij:k = 9:5 —GajV ip—Gial jy, = Angij— lth, J]-Lk, 7] =0 <=> Apgij = [tk, jJ+[7k, @] (319) 
The relation given by Eq. 317 can also be written in terms of the Christoffel symbols of 
the second kind using the index shifting operator, that is: 


Ongig = Gag Vin + Gail Gy (320) 


where Eq. 309 is used. 
Following the method of derivation given in Eq. 318, we obtain the following relation 
for the partial derivative of the components of the contravariant metric tensor: 


Ong! = Oy (E’ - E’) (Eq. 214) (321) 
= (0,E’) - E’ + E’- (0,E’) (product rule) 
= (-I,,E*) _E’ +E’. (-1!,E’) (Eq. 313) 


= -[',E*-E —T!,E'- E* 
SHO ga OTs (Eq. 214) 
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that is: - _ — 
ang! = 9%, — 9%, (322) 
Like Eq. 317 whose close link to the Ricci theorem is shown in Eq. 319, Eq. 322 can 
also be seen from this perspective, that is: 
9 = Ong? + age + aes =0 = Ag" = —9"T%, = Gals (323) 
The Christoffel symbols of the second kind with two identical indices of opposite variance 
type satisfy the following relations: 
pees 25 din ) =9;(In Vg) = t VSG (324) 
= Dee = oe g) = % He gee 


where the main relation can be derived as follows: 


i= + (Ojgia + Digi — 19%) (Eq. 308 with k = 7) (325) 
- a (Ogi; + O:gj1 — Agi;) (relabeling dummy j,/ in 1% term & g” = g'’) 
= 5 "Bay 
= 50" Os 
= co g (derivative of determinant) 


All the other forms of Eq. 324 can be obtained from the derived form by simple algebraic 
manipulations with the use of the rules of differentiation and natural logarithms. 

In orthogonal coordinate systems, the Christoffel symbols of the first kind can be clas- 
sified into three main groups, considering the identicality and difference of their indices, 
which are given by: 


Rett F8| = 5008 (no sum on 7) (326) 
ig = -5 Osi (i A J, no sum on i) (327) 
ij, k] = 0 (i ASK) (328) 


These three equations can be obtained directly from Eq. 307 with proper labeling of the 
indices (i.e. according to the labeling of the left hand side of Eqs. 326-328) noting that for 
orthogonal coordinate systems g;; = 0 when i 4 j. For example, Eq. 327 can be obtained 
as follows: 


1 
[ii, J] = 9 (O:9i3 + 191g — O;9%1) (Eq. 307 with 7 + 7 and 1 — j) (329) 
1 


5 (0+ 0 — O;9%«) (gi; = 9) 
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1 
= "o 4 Gti 


In fact, Eq. 326 can be obtained from Eq. 317 by exchanging 7 and k then replacing k 
with 2 followed by a simple algebraic manipulation. The middle equality of Eq. 326 is 
based on the fact that the Christoffel symbols are symmetric in their paired indices (Eq. 
310). We note that Eq. 326 includes the case of j = 7, ie. when all the three indices are 
identical, and hence |i, i] = 5 Oi Gis: For this reason, we did not add the condition i 4 j to 
Eq. 326. 

Returning to the above indicated consideration of the identicality and difference of the 
indices of the Christoffel symbols of the first kind, we have 4 main cases: (1) all the indices 
are identical, (2) only two non-paired indices are identical, (3) only the two paired indices 
are identical, and (4) all the indices are different. Eq. 326 represents case 1 and case 2, 
Eq. 327 represents case 3, and Eq. 328 represents case 4. This classification similarly 
applies to the Christoffel symbols of the second kind as represented by the upcoming Eqs. 
331-333. 

In orthogonal coordinate systems where g;; = g'! = 0 (i 4 j), the Christoffel symbols of 
the second kind are given by: 


ri, = [jk,@ (Eq. 308) (330) 
se, = (Eq. 235) 


with no sum on 7. Hence, from Eqs. 326-328 and Eq. 330 the Christoffel symbols of the 
second kind in orthogonal coordinate systems are given by: 


’ a 1 1 
Ra ee 5 Hig = Dg i = 52 In gii (no sum on 7) (331) 
j ge l et i 
LS F598 = ~ 9g, 019% (i ~AJj,nosumoniorj) (332) 
55 
ri, =0 (Aj #K) (333) 


where in the last step of Eq. 331 the well known rule of differentiating the natural logarithm 
is used. Like the Christoffel symbols of the first kind (refer to Eq. 326), Eq. 331 also 
includes the case of j = 7, i.e. when all the three indices are identical, that is Tt, = Bq OiGii- 

In orthogonal coordinate systems of a 3D space, the Christoffel symbols of both kinds 
vanish when the indices are all different, as seen above (Eqs. 328 and 333). Hence, out 
of a total of 27 symbols, representing all the possible permutations of the three indices 
including the repetitive ones, only 21 non-identically vanishing symbols are left since the six 
non-repetitive permutations are dropped. If we now consider that the Christoffel symbols 
are symmetric in their paired indices (Eqs. 310 and 311), then we are left with only 15 
independent non-identically vanishing symbols since six other permutations representing 
these symmetric exchanges are also dropped because they are not independent. 
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Accordingly, in orthogonal coordinate systems in a 3D space the 15 independent non- 
identically vanishing Christoffel symbols of the first kind are: 


aS his | Fain 11,3] = —hyhig 
ail = Shes 129) = "Shisha: 13,4) Sehthis 
13,3] = +h3h3,1 99,1) =—hshai 22,2) = +hyho» (334) 
22,3] = —hohog 23,2] = +hoho3 23,3] = +h3h32 
33, 1] = —A3h3,1 33, 2] = —h3h3.2 33,3] = +h3h3.3 


where the indices 1,2,3 stand for the three coordinates, h,,h2,h3 are the scale factors 
corresponding to these coordinates as defined previously (see § 2.5 and 2.6), and the 
comma indicates, as always, partial derivative with respect to the coordinate represented 
by the following index. For example, in cylindrical coordinates given by (p, ¢, z), h2,1 means 
the partial derivative of hz with respect to the first coordinate and hence ha; = 0,p = 1 
since hg = p (refer to Table 1) and the first coordinate is p. Similarly, in spherical 
coordinates given by (r,6,@), h3.2 means the partial derivative of hz with respect to the 
second coordinate and hence h3 = Og (rsin@) = rcos@ since hz = rsin@ (refer to Table 
1) and the second coordinate is . As indicated above, because the Christoffel symbols of 
the first kind are symmetric in their first two indices, the expression of the [21,1] symbol 
for instance can be obtained from the expression of the [12,1] symbol. 

The expressions given in Eq. 334 for the Christoffel symbols of the first kind in orthogonal 
coordinate systems are no more than simple applications of Eqs. 326 and 327 plus Eq. 
235. For example, the entry [12,1] can be obtained as follows: 


1 
12,1] = $dbon (Eq. 326) (335) 
a 
= 502 (hi)? (Eq. 235) 
= h, Ooh, (rules of differentiation) 
= hyhy2 (notation) 


Similarly, in orthogonal coordinate systems in a 3D space the 15 independent non- 
identically vanishing Christoffel symbols of the second kind are: 


ps ma p2, — — Paha. Be 2, hihi 
11 hi 11 Cha)2 11 (ha)? 
hi h hig 
Tig = abs Ty = +53 Ce — 
h h 
3. 3,1 1 hehe, Ot 2,2 
Ps = aos D5o = po id Po5 = Tips. (336) 
hho ho: h3.2 
Dy = — (hg) 33 = as T>3 = ee 
Tr = —Ashsa De as __ h3ha,e Pr can __ fas 
33 (hy)? 33 (hz)? 33 he 
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where the symbols are as explained above. Again, since the Christoffel symbols of the 
second kind are symmetric in their lower indices, the non-vanishing entries which are not 
listed above can be obtained from the given entries by permuting the lower indices. The 
relations of Eq. 336 can be obtained from the relations of Eq. 334 by dividing by (hj)? 
where 7 is the third index of the Christoffel symbol of the first kind. This can be justified 
by Eqs. 330 and 235. The relations of Eq. 336 can also be obtained directly from Eqs. 
331 and 332 plus Eq. 235, as done for the Christoffel symbols of the first kind where Eqs. 
326 and 327 were used. 

In any coordinate system, all the Christoffel symbols of the first and second kind vanish 
identically ff all the components of the metric tensor in the given coordinate system are 
constants. This can be seen from the definitions of the Christoffel symbols, as given by 
Eqs. 307 and 308, since the partial derivatives will vanish in this case. In affine coordinate 
systems all the components of the metric tensor are constants and hence all the Christoffel 
symbols of both kinds vanish identically. The prominent example is the orthonormal 
Cartesian coordinate systems where all the Christoffel symbols of the first and second 
kind are identically zero. This can also be seen from Eqs. 334 and 336 since the scale 
factors are constants (refer to Table 1). 

In cylindrical coordinate systems, identified by the coordinates (p, ¢, z), all the Christoffel 
symbols of the first kind are zero except: 


[22,1] = -p (337) 
[12,2] = [21,2] = p (338) 


where the indices 1,2 stand for the coordinates p,@ respectively. Also, in cylindrical 
coordinate systems the non-zero Christoffel symbols of the second kind are given by: 


rh = -p (339) 
1 
Tig = T 5, _ p (340) 
where the symbols are as explained above. Again, these results can be obtained from Eqs. 
334 and 336 using the scale factors of Table 1. 
In spherical coordinate systems, identified by the coordinates (r,6,¢), the non-zero 
Christoffel symbols of the first kind are given by: 


22,1] = <r (341) 
33,1] = —rsin?@ (342) 
1959); SS 29) OF (343) 
33,2] = —r? sin @cos@ (344) 
13,3):= (s13)= rain? ? (345) 
23,3] = [32,3]}= r*sin@cosé (346) 


where the indices 1,2,3 stand for the coordinates r,6,@ respectively. Also, in spherical 
coordinate systems the non-zero Christoffel symbols of the second kind are given by: 


cy = (347) 
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T33 = —rsin?é (348) 
if 

P= De = = (349) 

[3, = —sin@cosd (350) 
1 

eS (351) 

Peo y=: cord (352) 


where the symbols are as explained above. As before, these results can be obtained from 
Eqs. 334 and 336 using the scale factors of Table 1. As seen earlier (refer to § 2.2.2), 
all these coordinate systems (i.e. orthonormal Cartesian, cylindrical and spherical) are 
orthogonal systems, and hence Eqs. 334 and 336 do apply. 

Because there is an element of arbitrariness in the choice of the order of coordinates, 
and hence the order of their indices, the Christoffel symbols may be given in terms of 
coordinate symbols rather than their indices to be more explicit and to avoid ambiguity 
and confusion. For instance, in the above examples of the cylindrical coordinate systems 
identified by the coordinates (p,¢,z) we may use [@¢, p] instead of [22,1] and use 1B 
instead of [’?,. Similarly, for the spherical coordinate systems identified by the coordinates 
(r,@,¢) we may use [06,r] instead of [22,1] and use ine instead of T'?,. 

The Christoffel symbols of both kinds may also be subscripted or superscripted by the 
symbol of the metric tensor of the given space (e.g. |iJ, ki], and ”” T%,) to reveal the 
metric which the symbols are based upon. This is especially important when we have 
two or more different metrics related to two or more different spaces as it is the case, for 
instance, in differential geometry of 2D surfaces embedded in a 3D space where we have 
one metric for the surface and another metric for the 3D space. Alternatively, other means 
of distinction may be used such as using Latin or upper case indices for the Christoffel 
symbols of one metric and Greek or lower case indices for the Christoffel symbols of the 
other metric, e.g. [ij,k] and I, for the Christoffel symbols of the space metric and [a, 7] 
and [7 g tor the Christoffel symbols of the surface metric. However, the latter methods do 
not apply when the indices are numeric rather than symbolic. 

The number of independent Christoffel symbols of each kind (first and second) in general 
coordinate systems is given by: 
n?(n+1) 

2 


where n is the space dimension. The reason is that, due to the symmetry of the metric 
tensor there are nine) independent metric components, g;;, and for each independent 
component there are n distinct Christoffel symbols. Alternatively, we have n? permutations 
of the three indices including the repetitive ones, and out of these n? permutations we 
have (n(n — 1)n) permutations whose paired indices are different where the three factors 
correspond to the first, second and third index respectively. Now, due to the symmetry 
of the Christoffel symbols in their paired indices, half of these (n(n — 1)n) permutations 
are identical to the other half and hence they are not independent. Therefore, the total 


Not = (353) 
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number of independent Christoffel symbols is: 


(n(n—1)n) InP —nPt+n? nPt+n? — nv? (n+1) 


Na =n — = = = 4 
aw 2 2 2 2 en 
The partial derivative of the Christoffel symbol of the first kind is given by: 
sh 1 
Ox |t7, 4] = 3 (0,0; 9:1 + O,0:951 — O1,019%;) (355) 


where this equation is based on the definition of the Christoffel symbol of the first kind as 
given by Eq. 307. 

Finally, we should remark that there are several notations for the Christoffel symbols. 
As well as the symbols that we use in this book which may be the most common in 
use (i.e. [ij,k] for the first kind and I, for the second kind), the first kind may also 
be symbolized as I;;, or 7 while the second kind may be symbolized as Let or {i} 
as well as other notations. There may be some advantages or disadvantages in these 
different notations. For example, the notations [';;, and re may suggest, wrongly, that 
these symbols are tensors which is not the case since the Christoffel symbols are not 
general tensors although they are affine tensors. There may also be some advantages in 
the typesetting and writing of these notations or there are factors related to recognition, 
readability and even aesthetics. 


5.2 Covariant Differentiation 


The focus of this section is the operation of covariant differentiation of tensors which is a 
generalization of the ordinary partial differentiation. The ordinary derivative of a tensor is 
not a tensor in general. The objective of covariant differentiation is to ensure the invariance 
of derivative (i.e. being a tensor) in general coordinate systems, and this results in applying 
more sophisticated rules using Christoffel symbols where different differentiation rules for 
covariant and contravariant indices apply. The resulting covariant derivative is a tensor 
which is one rank higher than the differentiated tensor. In brief, the covariant derivative 
is a partial derivative of the tensor that includes differentiating the basis vectors as well as 
differentiating the components, as we will see. Hence, the covariant derivative of a general 
tensor can be given generically by: 


On (AGB; es Pn ae .-E”) = Ape | nS J Pe i (356) 
where the expression of Ae se will be given in the following paragraphs (see e.g. Eq. 


366). 

More explicitly, the basis vectors in general curvilinear coordinate systems undergo 
changes in magnitude and direction as they move around in their own space, and hence 
they are functions of position. These changes should be accounted for when calculating the 
derivatives of non-scalar tensors in such general systems. Therefore, terms based on using 
Christoffel symbols are added to the ordinary derivative terms to correct for these changes 
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and this more comprehensive form of derivative is called the covariant derivative. Since 
in rectilinear coordinate systems the basis vectors are constants, the Christoffel symbol 
terms vanish identically and hence the covariant derivative reduces to the ordinary partial 
derivative, but in the other coordinate systems these terms are present in general. As a 
consequence, the ordinary derivative of a non-scalar tensor is a tensor iff the coordinate 
transformations from Cartesian systems to that system are linear. 

It has been suggested that the “covariant” label is an indication that the covariant dif- 
ferentiation operator V.; is in the covariant position. However, it may also be true that 
“covariant” means “invariant” as the term “covariant” is also used in tensor calculus to 
mean “invariant”. In fact, “covariant” as opposite to “contravariant” applies even to the 
common form of ordinary partial differentiation since the commonly used partial differ- 
ential operator 0; is also in the covariant position. Anyway, contravariant differentiation, 
which may be associated with the operator V, can also be defined for covariant and 
contravariant tensors by raising the differentiation index of the covariant derivative using 
the index raising operator, e.g. 


AP Zig Ag. AP aga se (357) 


However, contravariant differentiation is rarely used. 

As an example of how to obtain the covariant derivative of a tensor, let have a vector 
A represented by contravariant components in general curvilinear coordinates, that is: 
A = A'E;. We differentiate this vector following the normal rules of differentiation and 
taking account of the fact that the basis vectors in general curvilinear coordinate systems 
are differentiable functions of position and hence they, unlike their rectilinear counterparts, 
are subject to differentiation using the product rule, that is: 


A.; =dO;A (definition) (358) 
=a, (4'B) 
= E,0,;A' + A'‘0;E; (product rule) 
= E,0;A' + ATSB, (Eq. 312) 
= E,0,A' + A‘T:, Bi (relabeling dummy indices i + k) 
= (0,A* + A'T;;) E; (taking common factor) 
= A’ jE; (notation) 


where A‘, which is a rank-2 mixed tensor, is labeled the “covariant derivative” of A’. 
Similarly, for a vector represented by covariant components in general curvilinear coor- 
dinate systems, A = A;E’, we have: 


A. =0;A (definition) (359) 
= 0; (AE 
= E'0;A; + A,0;E’ (product rule) 


= E'0;A; — ATi,;E" (Eq. 313) 


5.2 Covariant Differentiation 122 


= E'0;A; - A,T;E' (relabeling dummy indices i + k) 
= (O;A; — A,Ts,) E’ (taking common factor) 
= A;.;E' (notation) 


The same rules apply to tensors of higher ranks. For example, for a rank-2 mixed tensor, 
A = A,/E'E,, we have: 
An = OWA (definition) (360) 
= 0; (A,’E'E;) 
= (0,A,’) B'E; + A,’ (0,E') E; + A,’E' (0,E;) 
= (,A,’) E'E,; + A,’ (-T),E*) BE; + A,’E' (T9,E,) 
= (0,A;’) E'E; — A,’T!,E°B; + A,T%,E'E, 
= (0,4,/) FE; — AT? B'E; + A;°T!,E'E; 
a (A,A,’ = Asta A;T,) E'E; 
x A,’ ,E'E; 


product rule) 
Eqs. 312 & 313) 
Eq. 125) 


relabeling dummy indices) 


taking common factor) 


( 
( 
( 
( 
( 
( 


notation) 


Based on the above arguments and examples, the main rules of covariant differentiation 
can be outlined in the following examples. For a differentiable vector A, the covariant 
derivative of the covariant and contravariant forms of the vector is given by: 


Ayn OAs = T¥,Ag (covariant) (361) 
A}, = 0,47 +TLA* (contravariant) (362) 


Similarly, for a differentiable rank-2 tensor A, the covariant derivative of the covariant, 
contravariant and mixed forms of the tensor is given by: 


Arne OAc = TY Ate — TAs (covariant) (363) 
Ale = 0,A*+Ti Ae +e At (contravariant) (364) 
AR, =O,Ae + TRA, —Ty Al (mixed) (365) 


We note that for the mixed form there are two possibilities: one associated with the dyad 
E/E; and the other with the dyad E, EF’. 

Following the methods and techniques outlined in the previous examples, we can easily 
deduce the pattern of the operation of covariant differentiation. To obtain the covariant 
derivative of a tensor in general, we start with an ordinary partial derivative term of the 
component of the given tensor. Then for each tensor index an extra Christoffel symbol term 
is added, positive for contravariant indices and negative for covariant indices, where the 
differentiation index is one of the lower indices in the Christoffel symbol of the second kind. 
Hence, for a differentiable rank-n tensor A in general coordinate systems the covariant 
derivative with respect to the q’” coordinate is given by: 

AiieR 8 a, Aii-* 4 Te Ag 4 Ti, Ai-* bie esis al re Ai (366) 


lm...p3q lm...p lm...p Im...p 
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AA alee alae 

In fact, there is practically only one possibility for the arrangement and labeling of the 
indices in the Christoffel symbol terms of the covariant derivative of a tensor of any rank 
if the following rules are observed: 

1. The second subscript index of the Christoffel symbol is the differentiation index. 

2. The differentiated index of the tensor in the Christoffel symbol term is contracted 
with one of the indices of the Christoffel symbol using a new label and hence they 
are opposite in their covariant and contravariant type. 

3. The label of the differentiated index is transferred from the tensor to the Christoffel 
symbol keeping its position as covariant or contravariant. 

4. All the other indices of the tensor in the concerned Christoffel symbol term keep 
their labels, position and order. 

The ordinary partial derivative term in the covariant derivative expressions (see e.g. Eq. 
366) represents the rate of change of the tensor components with change of position as a 
result of moving along the coordinate curve of the differentiation index, while the Christof- 
fel symbol terms represent the change experienced by the local basis vectors as a result of 
the same movement. This can be seen from the development of Eqs. 358-360 where the 
indicated terms correspond to the components and basis vectors according to the product 
rule of differentiation. 

From the above discussion, it is obvious that to obtain the covariant derivative, the 
Christoffel symbols of the second kind should be obtained and these symbols are dependent 
on the metric tensor. Hence, the covariant derivative is dependent on having the space 
metric corresponding to the particular coordinate system. We also note that the covariant 
derivative of a tensor is a tensor whose covariant rank is higher than the covariant rank 
of the original tensor by one. Hence, the covariant derivative of a rank-n tensor of type 
(r,s) is a rank-(n + 1) tensor of type (r,s + 1). 

In all coordinate systems, the covariant derivative of a differentiable scalar function of 
position, f, is the same as the ordinary partial derivative, that is: 


fi = fi = Of (367) 


This is justified by the fact that the covariant derivative is different from the ordinary 
partial derivative because the basis vectors in general curvilinear coordinate systems are 
dependent on their spatial position, and since a scalar is independent of the basis vectors 
the covariant derivative and the partial derivative are identical. The derivation of the 
expressions of the covariant derivative of contravariant and covariant vectors and higher 
rank tensors, as given by Eqs. 358-360, clearly justifies this logic where the product rule 
does not apply due to the absence of a basis vector in the representation of a scalar. This 
can also be concluded from the covariant derivative rules as demonstrated in the previous 
statements and formulated in the above equations like Eq. 366 since a scalar has no free 
index and hence it cannot have any Christoffel symbol term. By a similar reasoning, since 
the Christoffel symbols are identically zero in rectilinear coordinate systems, the covariant 
derivative in these systems is the same as the ordinary partial derivative for all tensor 
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ranks, whether scalars or not. This can also be seen from the product rule (as employed in 
the development of Eqs. 358-360 for instance) where all the terms involving differentiation 
of basis vectors will vanish since these vectors are constant in rectilinear systems. 
Another important fact about covariant differentiation is that the covariant derivative of 
the metric tensor in its covariant, contravariant and mixed forms is zero in all coordinate 
systems and hence it is treated like a constant with respect to covariant differentiation. 
Accordingly, the covariant derivative operator bypasses the metric tensor, e.g. 


Ov, (gi; 4”) = G50 (368) 


and hence the metric tensor commutes with the covariant differential operator. We will 
expand on this issue later in this section. 

Several rules of ordinary differentiation are naturally extended to covariant differentia- 
tion. For example, covariant differentiation is a linear operation with respect to algebraic 
sums of tensor terms and hence the covariant derivative of a sum is the sum of the covariant 
derivatives of the terms, that is:!2?! 

(aA + bB)., = aA, + dB; (369) 
where a and 6 are scalar constants and A and B are differentiable tensors. The product 
rule of ordinary differentiation also applies to the covariant differentiation of inner and 
outer products of tensors, that is: 

(AoB),=A,oB+AoB, (370) 
where the symbol o denotes an inner or outer product operator. However, as seen in this 
equation, the order of the tensors should be observed since tensor multiplication, unlike 
ordinary algebraic multiplication, is not commutative (refer to § 3.2.3). The product 
rule is valid for the inner product of tensors because the inner product is an outer product 
operation followed by a contraction of indices, and covariant differentiation and contraction 
of indices do commute as we will see later. 

A principal difference between partial differentiation and covariant differentiation is that 
for successive differential operations with respect to different indices the ordinary partial 
derivative operators do commute with each other, assuming that the well known continuity 
condition is satisfied, but the covariant differential operators do not commute, that is: 


0,0; = 050; 0;;0.5 F 0.304 (371) 


This will be verified later in this section. 

As indicated earlier, according to the “Ricci theorem” the covariant derivative of the 
covariant, contravariant and mixed forms of the metric tensor is zero. This has nothing to 
do with the metric tensor being a constant function of coordinates, which is true only for 


[22] We use a semicolon with symbolic notation of tensors in this equation and other similar equations for 
the sake of clarity; the meaning should be obvious. 
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rectilinear coordinate systems, but this arises from the fact that the covariant derivative 
quantifies the change with position of the basis vectors in magnitude and direction as well 
as the change in components, and these contributions in the case of the metric tensor 
cancel each other resulting in a total null effect. For example, for the covariant form of 
the metric tensor we have: 


Si = Ox (gijE"E’) (definition) (372) 
= (On9i;) E’E? + ij (0,E*) E/ + 9ijE" (0,E) (product rule) 
= ([tk, j] + [jk, i]) EXE? + gi; (—j,E') E’ + gj’ (-Tj,E') (Eqs. 317 & 313) 


= [ik, j] EXE’ + [jk, 7] EXE! — g,,11, EE — 9,1, EE! 

= [ik, j] E’E? + [jk, i] E'E? — (Ik, 9] E'E’ — [lk, i] E'E! (Eq. 309) 

= [ik, j] E'E’ + [jk, i] EE? — [ik, j] E’E’ — [jk, 7] E'E? (relabeling dummy J) 
=0 


Similarly, for the contravariant form of the metric tensor we have: 

Sx = On (9 E.E;) (definition) (373) 
= (O.9") EXE; + 9” (0,E;) E; + gE; (0.E;) (product rule) 
= (-9¥T%, — 9°°T2,,) EXE; + 9” (T4,E.) Ej + gE; (0%,B.) (Eqs. 322 & 312) 
= —g@T,,E;E; — 9°12, E,B; + 9T%E,B; + 90%, BiE, 
= —9¥T',E,E, — oT, E,E,; + 9°1', B,B; + 91", E,E; (relabeling indices) 
=0 


As for the mixed form of the metric tensor we have: 


Lin = Op (5',E\E’) (definition) (374) 
= (O,5";) E,;E? + 5, (0,E,) E’ + OE; (0, E’) (product rule) 
= 0+ 6, ((Z,E,) E’ + 5B; (-I%,,E*) (Eqs. 312 & 313) 
= 6'.T4.E,E! — 61,07, E,E* 
= [¢,E.b! — 1),E,E* (Eq. 172) 
= I, .E,E/ — I. .E,E/ (relabeling dummy indices) 
=0 


As a result of the Ricci theorem, the metric tensor behaves as a constant with respect 
to the covariant derivative operation, that is: 


g. =0 (375) 


where g is the metric tensor in its covariant or contravariant or mixed form, as seen above. 
Consequently, the covariant derivative operator bypasses the metric tensor, that is: 


(go A), =goAy (376) 
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where A is a general tensor and the symbol o denotes an inner or outer tensor product.3! 


The commutativity of the covariant derivative operator and the metric tensor acting as an 
index shifting operator may be demonstrated more vividly by using the indicial notation, 
e.g. 
Gik (A*,) = Gin A®; = Ai; = (Ai); = (ginA®)., (377) 
GmiGng (A) = Gmigng A, = Auge = (Ais) 4 = (Gmignj A”) 4 (378) 
where we see that the sequence of these operations, seen in one order from the right and 
in another order from the left, has no effect on the final result in the middle. 
Like the metric tensor, the ordinary Kronecker delta tensor is constant with regard 
to covariant differentiation and hence the covariant derivative of the Kronecker delta is 
identically zero, that is:!?41 


he = Ono; + O°, — OT, (Eq. 365) (379) 
=0+ 65 1-6 Gk (5; is constant) 
SCE =i, (Eq. 172) 
=0 


Accordingly, the covariant differential operator bypasses the Kronecker delta tensor which 
is involved in inner and outer tensor products:!?5| 


(60A), =60Ay, (380) 


The rule of the Kronecker delta may be regarded as an instance of the rule of the metric 
tensor, as stated by the Ricci theorem, since the Kronecker delta is a metric tensor for 
certain systems and types. Like the ordinary Kronecker delta, the covariant derivative of 
the generalized Kronecker delta is also identically zero. This may be deduced from Eq. 379 
plus Eq. 199 where the generalized Kronecker delta is given as a determinant consisting of 
ordinary Kronecker deltas and hence it is a sum of products of ordinary Kronecker deltas 
whose partial derivative vanishes because each term in the derivative contains a derivative 
of an ordinary Kronecker delta. 

Based on the previous statements, we conclude that covariant differentiation and con- 
traction of index operations commute with each other, e.g. 


(Agi) 8) = Ards = Aisi = (Ai) a = (A855), om) 


where we see again that the different sequences from the right and from the left produce 
the same result in the middle. For clarity, we represented the contraction operation in this 


3] Although the metric tensor is normally used in inner product operations for raising and lowering 
indices, the possibility of its involvement in outer product operations should not be ruled out. 

24] For diversity, we use indicial notation rather than symbolic notation which we used for example in 
verifying g., = 0. 

[25] Like the metric tensor, the Kronecker delta tensor is normally used in inner product operations for 
replacement of indices; however the possibility of its involvement in outer product operations should 
not be ruled out. 
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example by an inner product operation of the ordinary Kronecker delta tensor with the 
contracted tensor. 

For a differentiable function f (x, y) of class C? (i.e. all the second order partial derivatives 
of the function do exist and are continuous), the mixed partial derivatives are equal, that 
is: 

0-0 yt = Og0rt (382) 
However, even if the components of a tensor satisfy this condition (i.e. being of class C”), 
this is not sufficient for the equality of the mixed covariant derivatives. What is required 
for the mixed covariant derivatives to be equal is the vanishing of the Riemann-Christoffel 
curvature tensor (see § 7.2.1) which is equivalent to having an intrinsically flat space. This 
will be verified later in this section. 

Higher order covariant derivatives are defined as derivatives of derivatives by successive 
repetition of the process of covariant differentiation. However, the order of differentiation, 
in the case of differentiating with respect to different indices, should be respected as stated 
in the previous statements. For example, the second order mixed jk covariant derivative 
of a contravariant vector A is given by: 


A, = (Aij) (383) 
OAL, +13, At, —T4, At, 
= 0 (0,A'+T%,A°) +1, (4° + T$A’) — 14, (6.4% + 15,4”) 
8,0;A* +14 ,0,A° + A°O,1%, + %4,0;A° + 14,09, 4° — 19,0, A* — 12.04, A° 


that is: 
At, = O,0;A' + 14,0, A° — 14,0, A' + 14,0;A° + A* (O03; —TE0 Go +15,03;) (384) 


The second order mixed kj covariant derivative of a contravariant vector can be derived 
similarly. However, it can be obtained more easily from the last equation by interchanging 
the j and & indices, that is: 


A’; = OjOpA® + 14,0jA° — TEjO.A* + 5 ,O¢A% + A® (Oo —PRVia + TG 8x) (885) 


The inequality of the jk and kj mixed derivatives in general can be verified by subtracting 
the two sides of the last two equations from each other where the right hand side will not 
vanish. 

Similarly, the second order mixed 7k covariant derivative of a covariant vector A is given 


by: 


Aijk = (Ais). (386) 
OpAi3 — TH, Aag — Gp Asa 


On (0;A; — Ty,As) = | ips (0; Aa 7 re Av) — Be (0.4; = Ty, Ab) 
= d,0;A; —TYjOrAn — AvOgl}; — 19,0; Aa + 19,03; As — 14,004: + 73,03, A0 


1a 
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that is: 
Aigr = O40;Ai — TZ0¢Aa — 14,0; Aa — 19,024; — Aa (OT %, mee bi Bi ri Ss) (387) 


The second order mixed kj covariant derivative of a covariant vector can be obtained from 
the last equation by interchanging the 7 and k indices, that is: 


Aiznj = Oj0n Ai — TG,0; Aa — UE O¢Aa — U4 ;0aAi — Aa (OTH, — 3TH, —TR,T%) (388) 


Again, the inequality of the jk and kj mixed derivatives can be verified by subtracting 
the two sides of the last two equations from each other where the right hand side will not 
vanish. 

We will see in § 7.2.1 that the mixed second order covariant derivatives of a covariant vec- 
tor A; are linked through the Riemann-Christoffel curvature tensor R%;, by the following 
relation: 

Aizjgk — Aik = AaR yx (389) 
This relation can be verified by subtracting the two sides of Eq. 388 from the two sides of 
Eq. 387 and employing the definition of the Riemann-Christoffel curvature tensor of Eq. 
560 to the right hand side, that is: 


j 
Ag (OP %, = iy bk <= ne ‘b) — A, (AT Y, = es bi = r’.%) 
Aa (00% — Ly ame ive ib — OD; + Te. Pia m4) 

= A, (OP = Ty bk ee oa ote Ty by) 

=) Ak 


ijk 


Aik > Aiisiej = —Ag (AT, i ie bi = 4) + Ag (OT %, ~ ie bk ~ Le ) (390) 


where the last step is based on Eq. 560. 

The covariant derivatives of relative tensors, which are also relative tensors of the same 
weight as the original tensors, are obtained by adding a weight term to the normal formulae 
of covariant derivative. Hence, the covariant derivative of a relative scalar with weight w 
is given by: . 

fa = fa—w fT, (391) 
while the covariant derivative of relative tensors of higher ranks with weight w is obtained 
by adding the following term to the right hand side of Eq. 366: 

— wae Ta, (392) 

Unlike ordinary differentiation, the covariant derivative of a non-scalar tensor with con- 
stant components is not zero in general curvilinear coordinate systems due to the presence 
of the Christoffel symbols in the definition of the covariant derivative, as given by Eq. 366. 
More explicitly, even though the partial derivative term is zero because the components 
are constant, the Christoffel symbol terms are not zero in general because the basis vectors 
are variables (refer to the derivation in Eqs. 358-360). 
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In rectilinear coordinate systems, the Christoffel symbols are identically zero because the 
basis vectors are constants, and hence the covariant derivative is the same as the ordinary 
partial derivative for all tensor ranks. As a result, when the components of the metric 
tensor g;; are constants, as it is the case for example in rectangular coordinate systems, 
the covariant derivative becomes ordinary partial derivative. The constancy of the metric 
tensor in this case can be concluded from the definition of the components of the metric 
tensor as dot products of the basis vectors, as seen for example in Eqs. 213 and 214 (also 
see Table 1). 

We remark that for a differentiable covariant vector A which is a gradient of a scalar 
field we have: 

Ais = Aj; (393) 


This can be easily verified by defining A as: A; = f; where f is a scalar and hence we 
have: 


Ais = (fy).j (394) 
= (fa), -—Ti fa (Eq. 361) 
= fig - Wee 
= fyi —V5if (Eqs. 382 & 311) 
= (f3).4 (Eq. 361) 
= Aj; 


Another important remark is that the covariant derivative of the basis vectors of the 
covariant and contravariant types is identically zero, that is: 


E,,, = 0;E; —T%¥E, = +1,E, —T7jE, = 0 (395) 
E) = 0)B' +1) j,B* = —PjjB* +TjjB* = 0 (396) 


where Eqs. 361 and 362 are used in the first steps, while Eqs. 312 and 313 are used in the 
second steps. 


5.3 Absolute Differentiation 


The absolute derivative of a tensor along a t-parameterized curve C(t) in an nD space 
with respect to the parameter ¢ is the inner product of the covariant derivative of the 
tensor and the tangent vector to the curve. In brief, the absolute derivative is a covariant 
derivative of a tensor along a curve. For a tensor A’, the inner product of At, which is 
a tensor, with another tensor is a tensor. Now, if the other tensor is a which is the 
tangent vector to a t-parameterized curve C(t) given by the equations u’ = u'(t) where 


i =1,---,n, then the inner product: 


as. (397) 
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is a tensor of the same rank and type as the tensor A’. The tensor given by the expression 
of Eq. 397 is called the “absolute” or “intrinsic” or “absolute covariant” derivative of the 


tensor A’ along the curve C and is symbolized by ae that is: 


dA* body” 
ot a dt 98) 
In fact, instead of basing the definition of absolute differentiation on the definition of the 
covariant differentiation as a dot product of the covariant derivative with the tangent to 
the curve as seen above, we can define absolute differentiation independently by following 
the same method that we used to obtain the covariant derivative through applying the 
differentiation process on the basis vectors as well as the components of the tensor (as seen 
in the derivation of Eqs. 358-360) and hence the expressions of the absolute derivative 
(e.g. the upcoming Eqs. 401 and 402) can be obtained directly by applying total differ- 
entiation to the tensor including its basis vectors. For example, the absolute derivative of 
a contravariant vector A = A’E, can be obtained as follows: 


dA d 
a ae 399 
dt dt (4'°E:) (399) 
At dE; 
= 5; “ mie di. (product rule) 
= E; dt Shai (chain rule) 
dA’ . aus 
=E,7 + ATSERG Eq. 312 
dt ape as (Eq. 312) 
At Bd 
os + BA’ ie (exchanging dummy indices 7, k) 
dA’ oa 
= ATE 
( deat ) : 
~ bt B; (definition of intrinsic derivative) 


which is the same as the upcoming Eq. 401. 
Following the above definitions, the absolute derivative of a differentiable scalar f is the 
same as the ordinary total derivative, that is: 
of = a (400) 
ot — dt 
This is because the covariant derivative of a scalar is the same as the ordinary partial 
derivative or because a scalar has no association with basis vectors to differentiate. The 
absolute derivative of a differentiable contravariant vector A’ with respect to the parameter 
t is given by: 
dA" At 5 age 


ee ale ee A4O1 
Ot dt} dt ou 
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Similarly, for a differentiable covariant vector A; we have: 


ee (402) 
ot dt Be dk 

Absolute differentiation can be easily extended to higher rank (> 1) differentiable tensors 
of type (m,n) along parameterized curves following the given examples and the pattern of 
covariant derivative. For instance, the absolute derivative of a mixed tensor of type (1, 2) 
A‘, along a t-parameterized curve C’' is given by: 


dA‘, ; du? dA‘, ‘ du b du b ’ du b 
5 A Gy = apt Paw Ae Ge — Piva ge — Tie Aia Ge oe) 
Since the absolute derivative is given generically by:?4 
OA du* 
ca a rs (404) 


it can be seen as an instance of the chain rule of differentiation where the two contracted 
indices represent the in-between coordinate differential. This can also be concluded from 
the above method of derivation of Eq. 399. Because the absolute derivative along a curve 
is just an inner product of the covariant derivative with the tangent vector to the curve, the 
well known rules of ordinary differentiation of sums and products also apply to absolute 
differentiation, as for covariant differentiation, that is: 


“) oA 0B 
) dA 0B 


where a and b are constant scalars, A and B are differentiable tensors and the symbol 
o denotes an inner or outer product of tensors. However, the order of the tensors in the 
products should be observed since tensor multiplication is not commutative. 

Because absolute differentiation follows the style of covariant differentiation, the metric 
tensor in its different variance types is in lieu of a constant with respect to absolute 
differentiation, that is: 


094; gi 


ot ot ue 
and hence it passes through the absolute derivative operator, that is: 
5 (gis A’) 5A! (9% Aj) _ 4 Ay 
= g;;— = q' 408 
bt ar, a 


26] We use the absolute derivative notation with the symbolic notation of tensors in this equation and some 
of the upcoming equations to ease the notation. This is similar to the use of the covariant derivative 
notation with the symbolic notation as seen here and in previous equations. The meaning of these 
notations should be clear 
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For coordinate systems in which all the components of the metric tensor are constants, 
the absolute derivative is the same as the ordinary total derivative, as it is the case in 
rectilinear coordinate systems, because the Christoffel symbol terms are zero in these 
systems according to Eq. 308. The absolute derivative of a tensor along a given curve is 
unique, and hence the ordinary derivative of the tensor along that curve in a rectangular 
coordinate system is the same as the absolute derivative of the tensor along that curve in 
any other system although the forms in the two systems may be different. 


To sum up, we list in the following bullet points the main rules of covariant and absolute 
differentiation: 

1. Tensor differentiation (represented by covariant and absolute differentiation) is the 
same as ordinary differentiation (represented by partial and total differentiation) but 
with the application of the differentiation process not only on the tensor components 
but also on the basis vectors that associate these components. 

2. The sum and product rules of differentiation apply to covariant and absolute differ- 
entiation as for ordinary differentiation (i.e. partial and total). 

3. The covariant and absolute derivatives of tensors are tensors. 

4. The covariant and absolute derivatives of scalars and affine tensors of higher ranks 
are the same as the ordinary derivatives. 

5. The covariant and absolute derivative operators commute with the contraction of 
indices. 

6. The covariant and absolute derivatives of the metric, Kronecker and permutation 
tensors as well as the basis vectors vanish identically in any coordinate system, that 
is: 


Gijlq = 9 Gan, se) (409) 
i _ i) 
jlq 0 Ontlg oh (410) 
Eijklq = O ey = 0 (411) 
Ei, = 0 Ei, = 0 (412) 


where the sign | represents covariant or absolute differentiation with respect to the 
space coordinate wu’. Hence, these tensors should be treated like constants in tensor 
differentiation. This applies to the covariant and contravariant forms of these tensors, 
as well as the mixed form when it is applicable. It also applies to the generalized, as 
well as the ordinary, Kronecker delta as seen in the above equations. 

7. Covariant differentiation increases the covariant rank of the differentiated tensor by 
one, while absolute differentiation does not change the rank or type of the differen- 
tiated tensor. 

8. The covariant and absolute derivatives in rectilinear systems are the same as the 
partial and total derivatives respectively for all tensor ranks. 
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5.4 Exercises and Revision 


5.1 


5.2 


5.3 


5.4 


5.9 


5.6 


5.7 


5.8 


5.9 


5.10 


5.11 


ool? 


5.13 


5.14 


5.15 


5.16 


Onde 


Why tensor differentiation (represented by covariant and absolute derivatives) is 
needed in general coordinate systems to replace the ordinary differentiation (rep- 
resented by partial and total derivatives)? 

Show that in general coordinate systems, the ordinary differentiation of the compo- 
nents of non-scalar tensors with respect to the coordinates will not produce a tensor 
in general. 

“The Christoffel symbols are affine tensors but not tensors”. Explain and justify this 
statement. 

What is the difference between the first and second kinds of the Christoffel symbols? 
Show that the Christoffel symbols of both kinds are not general tensors by giving 
examples of these symbols being vanishing in some systems but not in other systems 
and considering the universality of the zero tensor (see § 3.1.4). 

State the mathematical definitions of the Christoffel symbols of the first and second 
kinds. How these two kinds are transformed from each other? 

What is the significance of the Christoffel symbols being solely dependent on the 
coefficients of the metric tensor in their relation to the underlying space and coordinate 
system? 

Do the Christoffel symbols represent a property of the space, a property of the coor- 
dinate system, or a property of both? 

If some of the Christoffel symbols vanish in a particular curvilinear coordinate system, 
should these some necessarily vanish in other curvilinear coordinate systems? Justify 
your answer by giving some examples. 

Verify that the Christoffel symbols of the first and second kind are symmetric in their 
paired indices by using their mathematical definitions. 

Correct, if necessary, the following equations: 


0,E; = -T,E, 0;E* = —T%,,E” 
What is the significance of the following equations? 
E* .0;B; = Tk. E, -0;E’ = Ty, E;, - 0;E; = [ij, k] 
Derive the following relations giving full explanation of each step: 
O;gu = [29,0 + [17,7 IY, = 0; (In V9) 


Assuming an orthogonal coordinate system, verify the following relation: [ij,k] = 0 
wherei AJ F#K. 

Assuming an orthogonal coordinate system, verify the following relation: M, = 
50; Ing, with no sum over 7. 

Considering the identicality and difference of the indices of the Christoffel symbols of 
either kind, how many cases we have? List these cases. 

Prove the following relation which is used in Eq. 325: 0,9 = gg" Oigj1- 


5.4 Exercises and Revision 134 


5.18 


5.19 


5.20 


5.21 


5.22 


5.23 


5.24 


O20 


5.26 


5.27 


5.28 


5.29 


5.30 


5.31 
5.32 


5.33 


5.34 


In orthogonal coordinate systems of a 3D space the number of independent non- 
identically vanishing Christoffel symbols of either kind is only 15. Explain why. 
Verify the following equations related to the Christoffel symbols in orthogonal coor- 
dinate systems in a 3D space: 


(12, 1] = hihi Be = — 


Justify the following statement: “In any coordinate system, all the Christoffel symbols 
of either kind vanish identically zff all the components of the metric tensor in the given 
coordinate system are constants”. 

Using Eq. 307 with Eq. 239, find the Christoffel symbols of the first kind correspond- 
ing to the Euclidean metric of cylindrical coordinate systems. 

Give all the Christoffel symbols of the first and second kind of the following coordinate 
systems: orthonormal Cartesian, cylindrical and spherical. 

Mention two important properties of the Christoffel symbols of either kind with regard 
to the order and similarity of their indices. 

Using the entries in Eq. 336 and Table 1 and the properties of the Christoffel symbols 
of the second kind, derive these symbols corresponding to the metrics of the coordinate 
systems of question 5.22. 

Write the following Christoffel symbols in terms of the coordinates instead of the 
indices assuming a cylindrical system: [12,1], [23,1], [3, and T'3,. Do the same 
assuming a spherical system. 

Show that all the Christoffel symbols will vanish when the components of the metric 
tensor are constants. 

Why the Christoffel symbols of either kind may be superscripted or subscripted by the 
symbol of the underlying metric tensor? When this (or other measures for indicating 
the underlying metric tensor) becomes necessary? Mention some of the other measures 
used to indicate the underlying metric tensor. 

Explain why the total number of independent Christoffel symbols of each kind is equal 
to SS. zt) 

Why covariant differentiation of tensors is regarded as a generalization of the ordinary 
partial differentiation? 

In general curvilinear coordinate systems, the variation of the basis vectors should 
also be considered in the differentiation process of non-scalar tensors. Why? 

State the mathematical definition of contravariant differentiation of a tensor Aj. 
Obtain analytical expressions for A;.; and Bi, by differentiating the vectors A = A;E* 
and B = B’'E;. 

Repeat question 5.32 with the rank-2 tensors C = C;;E'E’ and D = DYE;E; to 
obtain Cj;., and D ae 

For a differentiable nee A of type (m,n), the covariant derivative with respect to 
the coordinate u* is given by: 


111Q-..4m 


Aitt2- lm = OA jig. +P 1 Alia. tm + P2 Aut. ‘lm ities Sea él 


Jij2---Inik Ou Jija-Jn J1d2--Jn Fij2-Jn 
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5.35 


5.36 


5.37 


5.38 


5.39 


5.40 


5.41 


5.42 


5.43 


5.44 


5.45 


5.46 


5.47 


5.48 


5.49 
5.50 


5.01 


5.52 
5.93 


SPARS Anne one Tt gabe 
Extract from the pattern of this expression the practical rules that should be followed 
in writing the analytical expressions of covariant derivative of tensors of any rank and 
type. 
In the expression of covariant derivative, what the partial derivative term stands for 
and what the Christoffel symbol terms represent? 
For the covariant derivative of a type (m,n,w) tensor, obtain the number of total 
terms, the number of negative Christoffel symbol terms and the number of positive 
Christoffel symbol terms. 
What is the rank and type of the covariant derivative of a tensor of rank-n and type 
(p,q)? 
The covariant derivative of a differentiable scalar function is the same as the ordinary 
partial derivative. Why? 
What is the significance of the dependence of the covariant derivative on the Christoffel 
symbols with regard to its relation to the space and coordinate system? 
The covariant derivative of tensors in coordinate systems with constant basis vectors 
is the same as the ordinary partial derivative for all tensor ranks. Why? 
Express, mathematically, the fact that the metric tensor is in lieu of constant with 
respect to covariant differentiation. 
Which rules of ordinary partial differentiation also apply to covariant differentiation 
and which rules do not? State all these rules symbolically for both ordinary and 
covariant differentiation. 
Explain why the covariant differential operators with respect to different indices do 
not commute, i.e. 0.;0.; 4 0.;0; (i # J). 
State the Ricci theorem about covariant differentiation of the metric tensor and prove 
it with full justification of each step. 
State, symbolically, the commutative property of the covariant derivative operator 
with the index shifting operator (which is based on the Ricci theorem) using the 
symbolic notation one time and the indicial notation another. 
Verify that the ordinary Kronecker delta tensor is constant with respect to covariant 
differentiation. 
State, symbolically, the fact that covariant differentiation and contraction of index 
operations commute with each other. 
What is the condition on the components of the metric tensor that makes the covariant 
derivative become ordinary partial derivative for all tensor ranks? 
Prove that covariant differentiation and contraction of indices commute. 
What is the mathematical condition that is required if the mixed second order partial 
derivatives should be equal, i.e. 0,0; = 0;0; (i 3)? 
What is the mathematical condition that is required if the mixed second order covari- 
ant derivatives should be equal, i.e. 0.;0.; = 0.;0, (i # J)? 
Derive analytical expressions for Aj;.;, and Aj,,; and hence verify that Aj.;, A Ai.4;. 
From the result of exercise 5.52 plus Eq. 560, verify the following relation: Aj;.;, — 


5.4 Exercises and Revision 136 


5.504 


5.99 


5.96 


5.07 


5.98 


5.99 


5.60 


5.61 


5.62 


5.63 


5.64 


5.65 


5.66 


5.67 


5.68 
5.69 


Aizng = Aa Rj: 

What is the covariant derivative of a relative scalar f of weight w? What is the 
covariant derivative of a rank-2 relative tensor At of weight w? 

Why the covariant derivative of a non-scalar tensor with constant components is not 
necessarily zero in general coordinate systems? Which term of the covariant derivative 
of such a tensor will vanish? 

Show that: A;.; = Aj.; where A is a gradient of a scalar field. 

Show that the covariant derivative of the basis vectors of the covariant and contravari- 
ant types is identically zero, i.e. E;,; = 0 and Ei, = 0. 

Prove the following identity: 


On (gi; A’ B’) = Aj,B' + A‘'Bix 


Define absolute differentiation descriptively and mathematically. What are the other 
names of absolute derivative? 

Write the mathematical expression for the absolute derivative of the tensor field AY, 
which is defined over a space curve C(t). 


Why the absolute derivative of a differentiable scalar is the same as its ordinary total 
derivative, i.e. of = a? 


Why the absolute derivative of a differentiable non-scalar tensor is the same as its 
ordinary total derivative in rectilinear coordinate systems? 

From the pattern of covariant derivative of a general tensor, obtain the pattern of its 
absolute derivative. 

We have A = A‘. E,E,;E*. Apply the ordinary total differentiation process (i.e. “) 
onto this tensor (including its basis vectors) to obtain its absolute derivative. 

Which rules of ordinary total differentiation also apply to intrinsic differentiation and 
which rules do not? State all these rules symbolically for both ordinary and intrinsic 
differentiation. 

Using your knowledge about covariant differentiation and the fact that absolute differ- 
entiation follows the style of covariant differentiation, obtain all the rules of absolute 
differentiation of the metric tensor, the Kronecker delta tensor and the index shifting 
and index replacement operators. Express all these rules in words and in symbols. 
Justify the following statement: “For coordinate systems in which all the components 
of the metric tensor are constants, the absolute derivative is the same as the ordinary 
total derivative”. 

The absolute derivative of a tensor along a given curve is unique. What this means? 
Summarize all the main properties and rules that govern covariant and absolute dif- 
ferentiation. 


Chapter 6 
Differential Operations 


In this chapter, we examine the main differential operations which are based on the nabla 
operator V as defined in tensor calculus using largely tensor notation. These operations 
are based on the various types of interaction between the vector differential operator 
nabla and tensors of different ranks where some of these interactions involve the dot 
and cross product operations. The chapter will investigate these operations in general 
coordinate systems and in general orthogonal coordinate systems which are a special case 
of the general coordinate systems. The chapter will also investigate these operations in 
Cartesian coordinate systems as well as the two most important and widely used curvilinear 
orthogonal coordinate systems, namely the cylindrical and spherical systems, because of 
their particular importance and widespread application in science and engineering. Due 
to the likely familiarity of the reader with these operations in Cartesian systems, which 
is usually acquired at this level from a previous course on vector calculus, we start our 
investigation from the Cartesian system. This may help to remind the reader and make 
the subsequent sections easier to understand. 

Regarding the cylindrical and spherical coordinate systems, we can use indexed general 
coordinates like u', u? and u® to represent the cylindrical coordinates (p,¢,z) and the 
spherical coordinates (r, 6, @) and hence we can express these operations in tensor notation 
as we do for the other systems. However, for the sake of clarity and to follow the more 
conventional practice, we use the coordinates of these systems as suffixes in place of the 
usual indices used in the tensor notation. In this context, we should insist that these 
suffixes are labels and not indices and therefore they do not follow the rules and conventions 
of tensor indices such as following the summation convention. In fact, there is another 
reason for the use of suffix labels instead of symbolic indices that is the components in 
the cylindrical and spherical coordinates are physical, not covariant or contravariant, and 
hence suffixing with coordinates is more appropriate (see § 3.3). 

Before we start this investigation, we should remark that the differentiation of a tensor 
increases its rank by one, by introducing an extra covariant index, unless it implies a 
contraction in which case it reduces the rank by one. Therefore the gradient of a scalar is 
a vector and the gradient of a vector is a rank-2 tensor, while the divergence of a vector is 
a scalar and the divergence of a rank-2 tensor is a vector. This may be justified by the fact 
that the gradient operator is a vector operator. On the other hand the Laplacian operator 
does not change the rank since it is a scalar operator; hence the Laplacian of a scalar is a 
scalar and the Laplacian of a vector is a vector and so on. 

We should also remark that there are other nabla based operators and operations which 
are subsidiary to the main ones and are used in pure and applied mathematics and science. 
For example, the following scalar differential operator, defined in Cartesian coordinates, 


137 
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is commonly used in science such as fluid dynamics: 


0 
V=AV ag = a (413) 


where A is a vector. As explained earlier (see § 1.2), the order of A; and 0; should be 
respected. Another example is the following vector differential operator which also has 
common applications in science: 


[A x V], = €ij¢AjOx (414) 


where, again, the order should be respected and the operator is defined in Cartesian 
coordinates. 


6.1 Cartesian Coordinate System 


6.1.1 Operators 


The nabla vector operator V is a spatial partial differential operator which is defined in 
Cartesian coordinate systems by: 


a 
Vea (415) 


Similarly, the Laplacian scalar operator is given by: 


oO? 0? 


2 — hoe — 
Gee oi Ox,O02 ; Ox;O2; 


= Vii = Oi (416) 


We note that the Laplacian operator may also be notated with A (as well as several other 
symbols such as 02). However, in the present book we do not use this notation for this 
purpose. 


6.1.2 Gradient 


Based on the above definition of nabla, the gradient of a differentiable scalar function of 
position f is a vector obtained by applying the nabla operator to f and hence it is defined 


by: 5 
[Vi]; = Val On Of = fi (417) 


Similarly, the gradient of a differentiable vector field A is the outer product (refer to § 
3.2.3) between the nabla operator and the vector and hence it is a rank-2 tensor, that is: 


[VA],; = GA; (418) 


This definition can be easily extended to higher rank tensors. 
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6.1.3 Divergence 


The divergence of a differentiable vector A is the dot product of the nabla operator and 
the vector A and hence it is a scalar given by: 


VY hed, = SV A =0A,2 43 (419) 
si 


The divergence operation can also be viewed as taking the gradient of the vector followed 
by a contraction. Hence, the divergence of a vector is invariant because it is the trace of 
a rank-2 tensor (see § 7.1.2). It may also be argued more simply that the divergence of a 
vector is a scalar and hence it is invariant. 
Similarly, the divergence of a differentiable rank-2 tensor A is a vector defined in one of 
its forms by: 
[V - A]; = Aji (420) 


and in another form by: 


[V- A], = OA; (421) 


These two forms may be given respectively, using the symbolic notation, by: 
V-A and V-At (422) 


where A” is the transpose of A. More generally, the divergence of a tensor of rank n > 2, 
which is a tensor of rank-(n — 1), can be defined in several forms, which are different in 
general, depending on the choice of the contracted index. 


6.1.4 Curl 


The curl of a differentiable vector A is the cross product of the nabla operator and the 
vector A and hence it is a vector defined by: 


OA 
[V x Al, = ci = CHEV FAL = €:jhOj Ak = CpAkg (423) 
j 
The curl operation may be generalized to tensors of rank > 1 (see for example § 7.1.4), 
and hence the curl of a differentiable rank-2 tensor A can be defined as a rank-2 tensor 
given by: 
[V x Al;, = Cnn Ans (424) 


The last example can be easily extended to higher rank tensors. We note that there is 
more than one possibility for the contraction of the last index of the permutation tensor 
with one of the tensor indices when the rank is > 1. 
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6.1.5 Laplacian 
The Laplacian of a differentiable scalar f is given by: 


etal ne Od 
a Ox,0%; 7 Ox,OX; 


Vf =6 Vif = Ouf = fi (425) 


The Laplacian of a differentiable vector A is defined for each component of the vector in 
a similar manner to the definition of the Laplacian acting on a scalar, that is: 


[V?A], = V? [A], = 0);4; (426) 


a 


This definition can be easily extended to higher rank tensors. 


6.2 General Coordinate System 


Here, we investigate the differential operators and operations in general coordinate sys- 
tems. We note that the definitions of the differential operations in Cartesian systems, as 
given in the vector calculus texts and as outlined in § 6.1, are essentially valid in general 
non-Cartesian coordinate systems if the operations are extended to include the basis vec- 
tors as well as the components, as we will see in the following subsections. We also note 
that the analytical expressions of the differential operations can be obtained directly if the 
expression for the nabla operator V and the spatial derivatives of the basis vectors of the 
given coordinate system are known. 


6.2.1 Operators 


The nabla operator V, which is a spatial partial differential vector operator, is defined in 
general coordinate systems by: 


V =E'0; (427) 
Similarly, the Laplacian operator is defined generically by: 
V? =diverad =V-V (428) 


More details about these operators will follow in the next subsections. 


6.2.2 Gradient 


Based on the definition of the nabla operator, as given in the previous subsection, the 
gradient of a differentiable scalar function of position, f, is given by: 


The components of this expression represent the covariant form of a rank-1 tensor, i.e. 
[Vf]; = fi, as it should be since the gradient operation increases the covariant rank of a 
tensor by one, as indicated earlier. Since this expression consists of a contravariant basis 
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vector and a covariant component, the gradient in general coordinate systems is invariant 
under admissible coordinate transformations. 
The gradient of a differentiable covariant vector A can similarly be defined as follows: 


[27] 


= E’E/ O,A;+ E' A,0;E! (product rule) 

= B'E!0;A; + E'A; (-T,E*) (Eq. 313) 

= F'E!0;A; — E'BTs, Ag (relabeling dummy indices 7 & k) 
= E'E! (0,4; — T'* Ax) (taking common factor E'E’) 

= B'E’A,,; (definition of covariant derivative) 


and hence it is the covariant derivative of the vector. Similarly, for a differentiable con- 
travariant vector A the gradient is given by: 


VA = E’9; (A’E;) (431) 


= E'E,0,A! +E’ A) O;E; (product rule) 

= E'E,0,A’ + E' A’ (T%.Ex) (Eq. 312) 

= F'E,0;A’ + E'E,IY,A* (relabeling dummy indices j & k) 
= E’E; (0,4? + y,A*) (taking common factor E'E;) 

= E'B; A’, (definition of covariant derivative) 


The components of the gradients of covariant and contravariant vectors represent, respec- 
tively, the covariant and mixed forms of a rank-2 tensor, as they should be since the 
gradient operation increases the covariant rank of a tensor by one. 

The gradient of higher rank tensors is similarly defined. For example, the gradient of a 
rank-2 tensor in its covariant, contravariant and mixed form is given by: 


VA = E'BVES (0;Ajx —Ti Am —Ty; Aj) = E'EVE* Aj: = (covariant) —((482) 
VA = E'E;E, (0,A"* Ty Aes 1,4”) = E'E,;E,A?* (contravariant) (433) 
VA = EWE, (0,4/ —T A)" +T%4;) = EEE, AS, (mixed) (434) 
VA = E'B,E* (0,4), + 17,4), —T 47) = EE,E*A’,,; (mixed) (435) 


We finally remark that the contravariant form of the gradient operation can be obtained 
by using the index raising operator. For example, the contravariant form of the gradient 
of a scalar f is given by: 


[Vi] = Of = 9 Of = 9% f5 =f" (436) 


271 We note that the basis vector E’ which associates the derivative operator in the following equations 
(as well as in similar equations and expressions) should be the last one in the basis tensor so that the 
order of the indices in the components and in the basis tensor are the same. So, strictly we should 
write VA = 0; (A;E/) E’ where 0; acts only on what is inside the parentheses and hence the final 
expression becomes VA = E/E‘A;.;. However, to avoid confusion we put the vector to the left relying 
on this understanding. 
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This can be easily extended to higher rank tensors. As seen, the contravariant form of the 
gradient operation increases the contravariant rank of the tensor by one. 


6.2.3 Divergence 


Generically, the divergence of a differentiable contravariant vector A is defined as follows: 


V:-A=E‘,- (A’E;) (437) 
= E' - 0; (A’E;) 
—E'. (A,.E;) (definition of covariant derivative) 
- (E' ; E;) Al; 
= JL Al, (Eq. 215) 
= A’, (Eq. 172) 


In more details, the divergence of a differentiable contravariant vector A’ is a scalar ob- 
tained by contracting the covariant derivative index with the contravariant index of the 
vector, and hence: 


V-A=A', (438) 
= a, Ai +T%,A? (Eq. 362) 
= 0,Ai + AI si (/9) (Eq. 324) 
= 0;A' + Hs) (/9) (renaming dummy index j) 
VI 
= ae (/gA’) (product rule) 


V9 


where g is the determinant of the covariant metric tensor g,;. The last equality may be 
called the Voss-Weyl] formula. 

The divergence can also be obtained by raising the first index of the covariant derivative 
of a covariant vector using a contracting contravariant metric tensor, that is: 


G Ags = (9" Ay), (Eq. 376) (439) 
= (4’) , (Eq. 228) 
= A‘, 
=V-A (Eq. 438) 


as before. Accordingly, the divergence of a covariant vector A; is obtained by using the 
raising operator, that is: 


Al = 9 Ags (440) 
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For a rank-2 contravariant tensor A, the divergence is generically defined by: 


V-A=E'0;- (A“Ej;E;) (441) 
= E'- 0; (A”*E,E,) 
=E'. (AXE,E;) (definition of covariant derivative) 
= (6-8) B.A! 
= 5)B, Att (Eq. 215) 
= E, A" (Eq. 172) 


The components of this expression represent a contravariant vector, as it should be since 
the divergence operation reduces the contravariant rank of a tensor by one. 

More generally, considering the tensor components, the divergence of a differentiable 
rank-2 contravariant tensor A” is a contravariant vector obtained by contracting the co- 
variant derivative index with one of the contravariant indices, that is: 


[V- A] = A¥, or [VA] = A¥, (442) 
And for a rank-2 mixed tensor At we have: 
[V- A], = A}, (443) 


Similarly, for a general tensor of type (m,n): A = AR? 1 é a the divergence with respect 


to its k*” contravariant index is defined by: 


IV . A]?® see ees tm =. (Ae i (444) 


Jigas Jnl; 


with the absence of the contracted contravariant index 7; on the left hand side. As a 
matter of notation, it should be understood that V-A is lower than the original tensor 
A by just one contravariant index and hence, unlike the common use of this notation, it 
is not necessarily scalar. 


6.2.4 Curl 


The curl of a differentiable vector is the cross product of the nabla operator V with the 
vector. For example, the curl of a vector A represented by covariant components is given 
by: 
culA=VxA (445) 
= E’0; x A;E! 
= Ayes (E' x E’) 
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= Aj. €7*E,, (Eq. 274) 
= 6% AjcEx 
ck ; 
= (0,A; —T5,A1) Ex (Eqs. 166 & 361) 


V9 


Hence, the k” contravariant component of curl A is given by: 


ijk 


iVxaAr== = (0,A; —T,Ar) (446) 


On expanding the last equation for the three components of a vector in a 3D space, 
considering that the terms of the Christoffel symbols cancel out due to their symmetry in 
the two lower indices,!?8] we obtain: 


VxA t= i (Q2A3 a 03 A) (447) 
VxA — a (O3Ay = 0 A3) (448) 
VxA : = 7 (O; Ag = 02 Aj) (449) 


Hence, Eq. 446 will reduce to: 


[V x A]* = —@,A; (450) 


6.2.5 Laplacian 


Generically, the Laplacian of a differentiable scalar function of position, f, is defined as 
follows: 


Vf = div (grad f) = V- (Vf) (451) 


Hence the simplest approach for obtaining the Laplacian in general coordinate systems 
is to insert the expression for the gradient, Vf, into the expression for the divergence. 
However, because in general coordinate systems the divergence is defined only for tensors 
having at least one contravariant index to facilitate the contraction with the covariant 


[28] That is: 
Ajj — Aj = OJAI A,T i; OA; Ani 
= O;A;— AgTY; — O:A5 + An 


= Aig — Aja 
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derivative index (see for example Eqs. 437 and 444) whereas the gradient of a scalar is 
a covariant tensor, the index of the gradient should be raised first before applying the 
divergence operation, that is: 


Wil HO fea gozt (452) 
Now, according to Eq. 438 the divergence is given by: 
: 1 : 
V- A=A',;, = —0O; (/gA' 453 
= za. (vaA) 459) 


On defining A = E,0’f and replacing A’ in Eq. 453 with 0'f using Eq. 452 we obtain: 


which is the expression for the Laplacian of a scalar function f in general coordinate 
systems. 

Another approach for developing the Laplacian expression in general coordinate systems 
is to apply the first principles by using the definitions and basic properties of the operations 
involved, that is: 

Vf =V- (Vf) ee) 
= 80, (BO, 
= E' - 0; (B’0;f) 
= E' - 0; (E’f;) 


= E'- (B’ fj) (Eq. 359) 

= (B'-B’) fis 

= 9" fi (Eq. 214) 

= (9 fp): (Eq. 376) 

= (9° ;f) 

= 0: (9 Of) + (GY Of) Tis Eg, 302) 

= 0; (90; f) + (90; f) TS, (renaming dummy indices i & k) 
= 04 (9A) + (99) = OVD (Ea. 3824) 


VI 


= 7 [ 99: (9 O;f) + (9% O;f) O:n/G] (taking 7 factor) 


ee 
V9 


which is the same as before (see Eq. 454). 
The Laplacian of a scalar f may also be shorthand notated with: 


Vi = 9 fj (456) 


d; (99%; f) (product rule) 
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The Laplacian of non-scalar tensors can be similarly defined. For example, the Laplacian 
of a vector B in its contravariant and covariant forms, B’ and B,, is a vector A (i.e. 
A = V’B) which may be defined in general coordinate systems as: 


As indicated earlier, the Laplacian of a tensor is a tensor of the same rank and variance 
type. 


6.3. Orthogonal Coordinate System 


In this section, we state the main differential operators and operations in general orthog- 
onal coordinate systems. These operators and operations are special cases of those of 
general coordinate systems which were derived in § 6.2. However, due to the widespread 
use of orthogonal coordinate systems, it is beneficial to state the most important of these 
operators and operations although they can be easily obtained from the formulae of general 
coordinate systems. 

For clarity, general orthogonal coordinate systems are identified in this section by the 


coordinates (q',...,q") with unit basis vectors (qi,...,@n) and scale factors (hi,...,n) 
where: ios A 
ae q’ 
qi = dX one = dX hin ge (no sum on 7) (458) 
1/2 .. 97 -1/2 
Or dai \? Og’ 
Es En > | > (55) | ae 


In the last equations, x/ and e; are respectively the coordinates and unit basis vectors in 
a Cartesian rectangular system, and r = z’e; is the position vector in that system. We 
remark that in orthogonal coordinate systems the covariant and contravariant normalized 


basis vectors are identical, as established previously in § 3.3, and hence q’ = q; and 
e) = &;. 
6.3.1 Operators 
The nabla operator in general orthogonal coordinate systems is given by: 
qi O 
V= ae 460 
Dae (160) 


while the Laplacian operator, assuming a 3D space, is given by: 


3 
Se il O (hyhghg O 
= 461 
V hihghs D 39 (h;)° Oq' ( 6 ) 
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6.3.2 Gradient 


The gradient of a differentiable scalar f in orthogonal coordinate systems, assuming a 3D 
space, is given by: 


(462) 


3 
= qi Of — a OF _ a2 Of _ Gs Of 
Vy 7 Ds h; Og’ 7 hy Oq! ) hg Oq? h3 Og? 
6.3.3 Divergence 


The divergence in orthogonal coordinate systems can be obtained from Eq. 438. Since 
for orthogonal coordinate systems the metric tensor according to Eqs. 233-236 is diagonal 
with \/g = hyhzh3 in a 3D space and h,Ai = At (no summation) according to Eq. 144, 
the last line of Eq. 488 becomes: 


1 O = Ai 
V-A = goa (/gA’) (463) 


3 
1 0 [hyhohs >, 
= A’ 
hihghs d. Og’ ( hj ) 


- cE Ea (hots Ai )+ + (tisA2) + + (5) 


where A is a contravariant differentiable vector and A’ represents its physical components. 


This equation is the divergence of a vector in general orthogonal coordinates as defined in 
vector calculus. We note that in orthogonal coordinate systems the physical components 
are the same for covariant and contravariant forms, as established before in § 3.3, and 


hence A; = Ai. 


6.3.4 Curl 


The curl of a differentiable vector A in orthogonal coordinate systems in 3D spaces is 
given by: 
1 mdi hae hsas 
VxA= Fehehe qt ag age (464) 
soe Oi Tae SpA. sare 


where the hat indicates a physical component. The last equation may also be given in a 


more compact form as: 


igi lts O( (hy, Ax) : 
[V x A], “> haha Oe (no sum on 7) (465) 
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6.3.5 Laplacian 


As seen earlier, for general orthogonal coordinate systems in 3D spaces we have: 


V9 = hahah gf = ae (no sum) gi=0 (#3) (466) 


and hence Eq. 454 becomes: 
i wo GO hehe t 
5 ie ( 467 
is hihohs d. Og (h;)° Og ( ) 


which is the Laplacian of a scalar function of position, f, in orthogonal coordinate systems 
as defined in vector calculus. 


6.4 Cylindrical Coordinate System 


For cylindrical coordinate systems identified by the coordinates (p, ¢, z), the orthonormal 
basis vectors are e,, eg and e,. Although the given components (i.e. A,,Ag and A,) are 
physical components, we do not use hats since the components are suffixed with coordinate 
symbols (refer to § 3.3). We use for brevity e,, as a shorthand notation for the unit dyad 
e,e, and similar notations for the other dyads. We remark that the following expressions 
for the operators and operations in cylindrical coordinate systems can be obtained from the 
definition of these operators and operations in general coordinate systems using the metric 
tensor of the cylindrical system (see § 4.5). They can also be obtained more simply from 
the corresponding expressions in orthogonal coordinate systems using the scale factors of 
the cylindrical system in Table 1. 

It should be obvious that since p, @ and z are labels for specific coordinates and not 
variable indices, the summation convention does not apply to these labels. We note that 
cylindrical coordinate systems are defined specifically for 3D spaces. We also note that 
the following operators and operations can be obtained for the 2D plane polar coordinate 
systems by dropping the z components or terms from the cylindrical form of these operators 
and operations. 


6.4.1 Operators 


The nabla operator V in cylindrical coordinate systems is given by: 


1 
V =e,0, + eg—05 + €.0, (468) 
p 
while the Laplacian operator is given by: 
5 1 1 
V= Os a pales | O., (469) 


where 0,, = 0,0, and the notation equally applies to other similar symbols. 


6.4.2 Gradient 


6.4.2 Gradient 


The gradient of a differentiable scalar f is given by: 
1 
Vf=e,0,f + eo Oot + e,0,f 


The gradient of a differentiable vector A is given by: 


VA = CppAp.p + €pg-Ag,p + CpzAz,p + 


1 “*) (- “) 
e -A,g— Le A 
op (2 PsP p og p o,0 p 


Cs Any ni C2gAGs a GL Ay. 


where €p,p¢,°** »@zz are unit dyads as defined above. 


6.4.3 Divergence 


The divergence of a differentiable vector A is given by: 
ys! 


The divergence of a differentiable rank-2 tensor A is given by: 


App —A a) 
V-A = & (Ano + m4 Agog + Azp,z 
p p 
2A i Ago —A 
ey (Aron + P+ — Agg,g + Asg,2 + 2 
Hp 
Ax. 1 
ez (Ane + 4 Agz¢ 4 Ans) 
psp 


We note that Eq. 472 can be obtained by contracting Eq. 471. 


6.4.4 Curl 


The curl of a differentiable vector A is given by: 


1| Ce Pee e& 
VxA=-|0, & A, 
PA. GAg A, 


6.4.5 Laplacian 


The Laplacian of a differentiable scalar f is given by: 


1 i 
VP = Opa ied povot | Oo. f 
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(470) 


(471) 


(472) 


(473) 


(474) 


(475) 
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The Laplacian of a differentiable vector A is given by: 
. 1 i) 2 
WA = ey Op ae (pA,) + paleo Ap + OzzAp = poee + (476) 
1 1 2 
eg On pa (pAg) + plo As + O2zAg + pelo e + 


1 1 
e, 2, (p0,A.) + polos + 0..A.| 


6.5 Spherical Coordinate System 


For spherical coordinate systems identified by the coordinates (r,@,@), the orthonormal 
basis vectors are e,, €g and eg. As in the case of cylindrical coordinates, the components 
are physical and we do not use hats for the same reason. We use for brevity e,9 as 
a shorthand notation for the dyad e,eg and similar notations for the other unit dyads. 
As for cylindrical systems, the following expressions for the operators and operations in 
spherical coordinate systems can be obtained from the corresponding definitions in general 
coordinate systems using the metric tensor of the spherical system (see § 4.5) or from the 
corresponding expressions in orthogonal systems using the scale factors of the spherical 
system in Table 1. Again, the summation convention does not apply to r, 0 and ¢ since 
they are labels and not indices. We also note that spherical coordinate systems are defined 
specifically for 3D spaces. 


6.5.1 Operators 


The nabla operator V in spherical coordinate systems is given by: 


1 1 
V= e,.0,. T €6- Op T ris an d (477) 
while the Laplacian operator is given by: 
2 1 cos 6 1 
2 
= O,», + -—O, + sOoe 4 — 0» 4 ) 478 
. in pe eT 2 sing 9 ' p2gin2 °F ee 


where O,, = 0,0, and the notation equally applies to other similar symbols. 


6.5.2 Gradient 
The gradient of a differentiable scalar f in spherical coordinate systems is given by: 


1 


rsin 


1 
Vf = e, 0, f eo Op f + €g got (479) 
The gradient of a differentiable vector A is given by: 


VA = Caen + eo Agr + ergAgr + (480) 
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A,e Ag Ago A, Age 
se ror Boe ro Or SRO eo 


gj (= -*2) | [se - Sart | (= A, | wet) 
or r €go r gg : 


r sin @ r r sin 0 r r sin 9 r r 


where e€,,,€;9,°** ,@g¢ are unit dyads as defined above. 


6.5.3 Divergence 


The divergence of a differentiable vector A is given by: 


Ot Ae) O (sin 0 Ag) OAs 
-A= 7 481 
Vv r? sin 6 sin Or "80 : Oo oo 
The divergence of a differentiable rank-2 tensor A is given by: 
Vv : A = O,. (r?A,,) Op (Aor sin 0) Oo Agr _ Aoo + Age | (482) 
r2 rsin6@ r sin 6 Tr 
é O,. (r3A,@) Op (Apo sin 0) 0.A¢e ; Aor = Aro es Ags cot 7 
r3 rsind rsind | r 
é O,. (r3A,) Oo (Age sin 0) Og Age | Auge a ues + Ago cot 0 
¢ r3 rsind “ orsind — r 


We note that Eq. 481 can be obtained by contracting Eq. 480. 


6.5.4 Curl 
The curl of a differentiable vector A is given by: 


1 e, reg rsinéeg 
ee A, rAg rsindAg 


6.5.5 Laplacian 


The Laplacian of a differentiable scalar f is given by: 


2 1 cos 8 1 
Vif = Omf + SOnf + Gna + Goof + Sap Ooo (484) 


The Laplacian of a differentiable vector A is given by: 
V2A = @, a, € i) Op (sin 009 A,) | Og¢Ar a 205 (Ap sin 0) a 20,Ag 


| (485) 


r2 r2 sin 0 ' p2sin? 6 r2 sin 6 r2 sin 6 
é Or (r?0,Ag) 1 8 Op (Ag sin 0) Os¢Ao 209A, 7 2 cot 6 it 
: [a Dope? sin @ pr sin? r? rsino * * 


‘ = (r?0,Ag) 1 a (+ (Ag sin 2) Os¢Ag 20 Ay 2cot 6 | 


A 
0 ; T z T 7 1 5 0 
r2 r2 sin 9 r2sin?@ r2sin@  r2sin@ % 
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6.6 Exercises and Revision 


6.1 


6.2 


6.3 


6.4 


6.5 


6.6 


6.7 


6.8 


6.9 


6.10 


6.11 


6,12 


6.13 


6.14 


6.15 


6.16 


6.17 


6.18 


6.19 


Describe briefly the nabla based differential operators and operations considering the 
interaction of the nabla operator with the tensors which are acted upon by this oper- 
ator. 

What are the advantages and disadvantages of using the coordinates as suffixes for 
labeling the operators, basis vectors and tensor components in cylindrical and spher- 
ical systems instead of indexed general coordinates? What are the advantages and 
disadvantages of the opposite? 

“The differentiation of a tensor increases its rank by one, by introducing an extra 
covariant index, unless it implies a contraction in which case it reduces the rank by 
one”. Justify this statement giving common examples from vector and tensor calculus. 
Write the following subsidiary nabla based operators in tensor notation: A - V and 
A x V. Is this notation consistent with the notation of dot and cross product of 
vectors? 

Why in general we have: A-VAV-AandAxV#V x A? 

Define the nabla vector operator and the Laplacian scalar operator in Cartesian co- 
ordinate systems using tensor notation. 

Find the gradient of the following vector field in a Cartesian coordinate system: A = 
(2a). 

Define the divergence of a differentiable vector descriptively and mathematically as- 
suming a Cartesian coordinate system. 

What is the divergence of the following vector field in Cartesian coordinates: A = 
(Qe y° 67)? 

Write symbolically, using tensor notation, the following two forms of the divergence 
of a rank-2 tensor field A in Cartesian coordinates: V- A and V- A’. 

Define the curl V x A in Cartesian coordinates using tensor notation where (a) A 
is a rank-1 tensor and (b) A is a rank-2 tensor (note the two possibilities in the last 
case). 

What is the curl of the following vector field assuming a Cartesian coordinate system: 
Ae [5e"? ary. 27)? 

Find the Laplacian of the following vector field in Cartesian coordinates: A = 
(x?y, 2y sin z, mzeCh®), 

Define the nabla operator and the Laplacian operator in general coordinate systems 
using tensor notation. 

Obtain an expression for the gradient of a covariant vector A = A,E’ in general coordi- 
nates justifying each step in your derivation. Repeat the question with a contravariant 
vector A = A’E;. 

Repeat question 6.15 with a rank-2 mixed tensor A = AL E,E!. 

Define, in tensor language, the contravariant form of the gradient of a scalar field. 
Define the divergence of a differentiable vector descriptively and mathematically as- 
suming a general coordinate system. 

Derive the following expression for the divergence of a contravariant vector A in 
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6.20 


6.21 


6.22 


6.23 


6.24 


6.25 


6.26 


6.27 


6.28 


6.29 


6.30 


6.31 


6.32 
6.33 


6.34 


6.35 


6.36 


6.37 


6.38 


6.39 


general coordinates: V- A = A',. 
Verify the following formula for the divergence of a contravariant vector A in general 
coordinates: V- A = Tq (/g4’). Repeat the question with the formula: V-A = 


g' Aj; where A is a covariant vector. 

Repeat question 6.20 with the formula: V-A = BE, A‘* where A is a rank-2 con- 
travariant tensor. 

Prove that the divergence of a contravariant vector is a scalar (i.e. rank-0 tensor) by 
showing that it is invariant under coordinate transformations. 

Derive, from the first principles, the following formula for the curl of a covariant vector 
field A in general coordinates: [V x A]* = — (0,A; —T4,Ai). 

Show that the formula in exercise 6.23 will reduce to [V x A]* = OA; due to the 
symmetry of the Christoffel symbols in their lower indices. 

Derive, from the first principles, the following expression for the Laplacian of a scalar 
field f in general coordinates: V?f = Foi (,/99"0;f). 

Why the basic definition of the Laplacian of a scalar field f in general coordinates 
as V?f = div (grad f) cannot be used as it is to develop a formula before raising the 
index of the gradient? 

Define, in tensor language, the nabla operator and the Laplacian operator assuming 
an orthogonal coordinate system of a 3D space. 

Using the expression of the divergence of a vector field in general coordinates, obtain 
an expression for the divergence in orthogonal coordinates. 

Define the curl of a vector field A in orthogonal coordinates of a 3D space using 
determinantal form and tensor notation form. 

Using the expression of the Laplacian of a scalar field in general coordinates, derive 
an expression for the Laplacian in orthogonal coordinates of a 3D space. 

Why the components of tensors in cylindrical and spherical coordinates are physical? 
Define the nabla and Laplacian operators in cylindrical coordinates. 

Use the definition of the gradient of a scalar field f in orthogonal coordinates and 
the scale factors of Table 1 to obtain an expression for the gradient in cylindrical 
coordinates. 

Use the definition of the divergence of a vector field A in orthogonal coordinates and 
the scale factors of Table 1 to obtain an expression for the divergence in cylindrical 
coordinates. 

Write the determinantal form of the curl of a vector field A in cylindrical coordinates. 
Use the definition of the Laplacian of a scalar field f in orthogonal coordinates and 
the scale factors of Table 1 to obtain an expression for the Laplacian in cylindrical 
coordinates. 

A scalar field in cylindrical coordinates is given by: f (p,¢,z) = p. What are the 
gradient and Laplacian of this field? 

A vector field in cylindrical coordinates is given by: A (p, 6,2) = (32,77, z? cos p). 
What are the divergence and curl of this field? 

Repeat exercise 6.33 with spherical coordinates. 
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6.40 Repeat exercise 6.34 with spherical coordinates. 

6.41 Repeat exercise 6.35 with spherical coordinates. 

6.42 Repeat exercise 6.36 with spherical coordinates. 

6.43 A scalar field in spherical coordinates is given by: f (r,6,¢) =r? +0. What are the 
gradient and Laplacian of this field? 

6.44 A vector field in spherical coordinates is given by: A (r,6,¢) = (e’, 5sin@, In). What 
are the divergence and curl of this field? 


Chapter 7 
Tensors in Application 


In this chapter, we conduct a preliminary investigation about some tensors and tensor 
notation and techniques which are commonly used in the mathematical and physical ap- 
plications of tensor calculus. The chapter is made of three sections dedicated to tensors in 
mathematics, geometry and science. The mathematics part (see § 7.1) comes from tensor 
applications in linear algebra and vector calculus, while most of the materials in the sci- 
ence part (see § 7.3) come from applications related to fluid and continuum mechanics. In 
the geometry part (see § 7.2) a few prominent tensors and tensor identities of wide appli- 
cations in differential geometry and related scientific fields are examined. We should also 
refer the reader to the previous chapter (see § 6) as an example of tensor applications in 
mathematics and science since the materials in that chapter are partly based on employing 
tensor notation and techniques. 

We note that all the aforementioned disciplines, where the materials about tensors come 
from, are intimately linked to tensor calculus since large parts of this subject were devel- 
oped within those disciplines. Also, the materials presented in this chapter about tensors 
and tensor techniques are used as vital building blocks and tools in a number of important 
mathematical and physical theories. However, we would like to insist that although these 
materials provide a very useful glimpse, they are just partially representative examples of 
tensor applications. Our objective is to have more familiarity with some prominent tensors 
and tensor techniques and hence they are not meant to provide a comprehensive view. We 
should also indicate that some tensors, especially those in the science section, are defined 
differently in different disciplines and hence the given definitions and descriptions may not 
be thorough or universal. 


7.1 Tensors in Mathematics 


In this section, we provide a sample of common definitions related to basic concepts and 
operations in matrix and vector algebra using the language of tensor calculus. Common 
identities in vector calculus as well as the integral theorems of Gauss and Stokes and 
some important scalar invariants are also examined in this section from this perspective. 
Finally, we provide a rather extensive set of examples about the use of tensor language 
and techniques in proving mathematical identities where these identities are gathered 
from vector calculus. For simplicity, clarity and wide applicability we employ a Cartesian 
approach in the tensor formulations of this section. 
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7.1.1 Common Definitions in Tensor Notation 


The trace of a matrix A representing a rank-2 tensor in an nD space is given by: 


For a 3 xX 3 matrix representing a rank-2 tensor in a 3D space, the determinant is given 


by: 


Ay, Aj. Ais 
det (A) =| Ag, Agog Ag3 | = €ijr Ari Any Aak <= €ijr Ai AjzAr3 (487) 
Az, Azo A33 


where the last two equalities represent the expansion of the determinant by row and by 
column. Alternatively, the determinant of a 3 x 3 matrix can be given by: 


i 
det (A) = 3 Siskel Ait AjmAin (488) 


More generally, for an n x n matrix representing a rank-2 tensor in an nD space, the 
determinant is given by: 


det (A) = Gin Ata hee Ane. (489) 
= Casa oes Ain 
i 


= a yfiamin in Aina «+» Ainin 


The inverse of a matrix A representing a rank-2 tensor is given by: 
1 
—1 = é : 
[A ls =2 0 yet (A) €ipg €jmn AmpAng (490) 
The multiplication of a matrix A by a vector b, as defined in linear algebra, is given by: 
[Ab]; = Aid; (491) 


It should be remarked that we are using matrix notation in the writing of Ab. According 
to the symbolic notation of tensors, the multiplication operation should be denoted by a 
dot between the symbols of the tensor and the vector, i.e. A-b, since matrix multiplication 
in matrix algebra is equivalent to an inner product operation in tensor algebra. 
Similarly, the multiplication of two compatible matrices A and B, as defined in linear 
algebra, is given by: 
[AB], = Aig Bix (492) 


Again, we are using here matrix notation in the writing of AB; otherwise a dot should be 
inserted between the symbols of the two matrices. 
The dot product of two vectors of the same dimension is given by: 
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Similarly, the cross product of two vectors in a 3D space is given by: 
[A x B], = jh Aj Br (494) 


The scalar triple product of three vectors in a 3D space is given by: 


A, Ag A3 
A. (B x C) = By Bo Bz = Ei AG DiC ye (495) 
Cy Cy C3 


while the vector triple product of three vectors in a 3D space is given by: 
[A x (B x C)], = €ijx€kimAj Bim (496) 


The expression of the other principal form of the vector triple product, i.e. (A x B) x C, 
can be obtained from the above form by changing the order of the factors in the external 
cross product and reversing the sign. Other operations, like relabeling the indices and 
exchanging some of the indices of the epsilons with a shift in sign, can then follow to 
obtain a more organized form. 


7.1.2 Scalar Invariants of Tensors 


In the following, we list and write in tensor notation a number of invariants of low rank 
tensors which have special importance due to their widespread applications in vector and 
tensor calculus. All these invariants are scalars. 

The value of a scalar (rank-0 tensor), which consists of a magnitude and a sign, is 
invariant under coordinate transformations. An invariant of a vector (rank-1 tensor) under 
coordinate transformations is its magnitude, i.e. length.?9! The main three independent 
scalar invariants of a rank-2 tensor A are: 


TIT = tr(A®*) = AyAjeAni (499) 


Different forms of the three invariants of a rank-2 tensor A, which are also widely used, 
are the following (noting that some of these definitions may belong to 3D specifically): 


1 if 
Ts Se (I? — II) = 5 (AyAj; — AijAji) (501) 
if i! 
Iz; = det(A)= al (1° — 32 II +2111) = 3 Ck epar AipAjq Arr (502) 


9] The direction is also invariant but it is not a scalar! In fact the magnitude alone is invariant under 
coordinate transformations even for pseudo vectors because it is a true scalar. 
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where J, [J and III are given by Eqs. 497-499. As the invariants J,, [2 and /3 are defined 
in terms of the invariants J, JJ and III according to Eqs. 500-502, the invariants J, IJ 
and III can also be defined in terms of the invariants [,, [2 and J3, that is: 


PS (503) 
WS S265 (504) 
HES f= Ship 3, (505) 


Since the determinant of a matrix representing a rank-2 tensor is invariant (see Eq. 502), 
then if the determinant vanishes in one coordinate system, it will vanish in all transformed 
coordinate systems, and if not it will not (also refer to § 3.1.4). Consequently, if a rank-2 
tensor is invertible in a particular coordinate system, it will be invertible in all coordinate 
systems, and if not it will not. 

The following are ten common scalar invariants that are jointly formed between two rank- 
2 tensors A and B: tr(A), tr (B), tr (A’), tr (B’), tr (A%), tr (B®), tr(A- B), tr (A? -B), 
tr (A - B?) and tr(A?-B?). As seen, all these are traces. 


7.1.3 Common Identities in Vector and Tensor Notation 


In this subsection, we present some of the widely used identities of vector calculus using 
the traditional vector notation as well as its equivalent tensor notation. In the following 
equations, f and h are differentiable scalar fields; A, B, C and D are differentiable vector 
fields; and r = 2;e; is the position vector. 


Ver =n (vector notation) (506) 
Of; Sn (tensor notation) (507) 


where n is the space dimension. 


Vxr = 0 (508) 
CGLOjL = 0 (509) 
V(a-r) =a (510) 
O; (a;2;) = aj (511) 

where a is a constant vector. 

VulV i) = WF (512) 
O(Af) = if (513) 
V:(VxA) = 0 (514) 


€;j;h0;0; Ak = 0 (515) 
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Yew) 
€ijh0jOnf = 0 


V(fh) = fVh+havef 
0; (fh) = fO;h a hd; f 


V-(fA) = fV- A+A-Vf 
O(fAi:) = foA;+ Adi f 


Vx(fA) = fVxA4+VfxA 
€ijkO; (fAn) = feigrOj;An + €ijn (O;f) Az 


A-(Bx C)=C-(A x B) =B: (Cx A) 
eRALD Cr = Ch CRAB = Ej Dj ChA; 


Ax(BxC) = B(A-C)—C(A-B) 
€ijhAj€kimBiCm = Bi (AmCm) — Ci (ALB) 


Ax(VxB) = (VB)-A—A-VB 
€ijk€kimAjOLBm = (0;Bm) Am — Ai (O;B:) 


Vx(VxA) = V(V-A)-V’A 


€ijk€klmOj1Am 


V(A-B) = Ax(VxB)+Bx(VxA)+(A-V)B+(B-V)A 


0;(AmBm) = €ijnAj (€tmO1Bm) + €ijn-By (€ktmOAm) + (ArO;) Bi + (Bi0;) A 


V-(AxB) = B-(VxA)—A-(V xB) 
0; (€:j;rAj Br) = By, (€4:j0;A;) = Ay (€;:0; Br) 


Vx(AxB) = (B-V)A+(V-B)A—(V-A)B-(A-V)B 
€ijk€kImO; (ArBm) = (BmOm) Ai + (OmBm) Ai — (OjA3) Bi — (AjO;) Bi 


A-C A-D 
(Ax B)-(C xD) = ee aaa 
jr Aj BréimCiDm = (AiCi) (BmDm) — (AmPm) (BiCi) 
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(A x B) x (C xD) [D -(A x B)|C—[C- (A x B)|D (540) 
Cake jit neipd Hs = (Coed aaa) C; cam (GimnCpAmbn) D; (541) 


In vector and tensor notations, the condition for a vector field A to be solenoidal is given 
by: 

V-A = 0 (542) 
In vector and tensor notations, the condition for a vector field A to be irrotational is given 


by: 


VxA = 0 (544) 
€;j40; Ak (545) 


I 
= 


7.1.4 Integral Theorems in Tensor Notation 


The divergence theorem for a differentiable vector field A in vector and tensor notations 
is given by: 


— 
s) 
> 
Q 
4 
| 
== 
> 
5 
Q 
9 


(546) 


Q Ss 


where 2) is a bounded region in an nD space enclosed by a generalized surface S, dr and 
do are generalized volume and area differentials, n and n; are the unit vector normal to 
the surface and its 7” component, and the index 7 ranges over 1,...,n. 

Similarly, the divergence theorem for a differentiable rank-2 tensor field A in tensor 
notation for the first index is given by: 


Q S 


while the divergence theorem for differentiable tensor fields of higher rank A in tensor 
notation for the index k is given by: 


| erAistomir =f Aig s.mmda (549) 
Q S 


Stokes theorem for a differentiable vector field A in vector and tensor notations is given 


by: 
[fv xa) nao = | A-d (550) 


S C 
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where C' stands for the perimeter of the surface S, and dr is a differential of the position 
vector which is tangent to the perimeter while the other symbols are as defined above. 

Similarly, Stokes theorem for a differentiable rank-2 tensor field A in tensor notation for 
the first index is given by: 


[cndsAunido = ff Aude, (552) 
Ss Cc 


while Stokes theorem for differentiable tensor fields of higher rank A in tensor notation 
for the index k is given by: 


| €ijkOj Aim...k..nMiGdo = [ Aim...k..n dX k (553) 
S Cc 


7.1.5 Examples of Using Tensor Techniques to Prove Identities 


In this subsection, we provide some examples for using tensor techniques to prove vec- 
tor and tensor identities where comments are added next to each step to explain and 
justify. These examples, which are based on the identities given in § 7.1.3, demonstrate 
the elegance, efficiency and clarity of the methods and notation of tensor calculus. We 
note that in Cartesian coordinate systems, some tensor equations in general coordinates 
change their form and hence in the following we give the corresponding Cartesian form of 
the equations which are not given previously in this form since we use the Cartesian form 
in these proofs. The added comments inside the parentheses refer to the corresponding 
equations in general coordinates: 


€ijkElmk = di10 jm — OpOy (Eq. 167) (554) 
OX; 
an Oj0; = ti5 = Oi (Eq. 175) (556) 
J 
0,2; = 64 =n (Eq. 176) (557) 
eV-r=n 
V-r=0;2; (Eq. 419) 
= Oi (Eq. 557) 
= (Eq. 557) 
eVxr=0: 
[V x r], = CEO; T (Eq. 423) 
= €ijkOkj (Eq. 556) 
= €ij5 (Eq. 555) 
= 0 (Eq. 155) 


Since 7 is a free index, the identity is proved for all components. 
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e V (a-r) =a: 
[V (a-r)], = 0; (a;x;) 
= az0a + 0i0s 


= ajO;2 ; 
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Eqs. 417 & 493) 
product rule) 

a; is constant) 
Eq. 556) 

Eq. 555) 


definition of index) 


i a a i a 


Since 7 is a free index, the identity is proved for all components. 


eV: (Vf) = Vf: 
V- (Vf) = alVAl; 
= 0; (0;f) 
= 0;0;f 
= Ouf 
—V'f 
eV-(V x A) =0: 

V-(V x A) =90;[V x A], 
= 0; (€:jnOj Ar) 
= €;j,0;:0; Ak 
= €;j,0j;0; Ax 
= —€;j,0j0; Ax 
= —€;4;,0;0; Ax 
= 0 


(Eq. 419) 

(Eq. 417) 

(rules of differentiation) 
(definition of 2nd derivative) 
(Eq. 425) 


Eq. 419) 
Eq. 423) 


€ is constant) 


Eq. 180) 
relabeling dummy indices 7 and 7) 


( 
( 
( 
(continuity condition) 
( 
( 
(since €;;,0;0; A, = —€;;~0;0; Ax) 


This can also be concluded from line three by arguing that: since by the continuity con- 
dition 0; and 0; can change their order with no change in the value of the term while a 
corresponding change of the order of 7 and 7 in €,;, results in a sign change, we see that 


each term in the sum has its own negative and hence the terms add up to zero. 


eV x (Vf) =90: 

[V x (VF)], = €:j40; [VF], (Eq. 423) 
= €ijh0; (Oxf) (Eq. 417) 
= €4;n0;0% f (rules of differentiation) 
= 640,037 (continuity condition) 
= —€7j0,0; f (Eq. 180) 
= —€:;,0j;Onf (relabeling dummy indices j and k) 
= 0 (since €;;,0;0%f = —€:;,0;Of) 


This can also be concluded from line three by a similar argument to the one given in the 
previous point. Because [V x (V/f)], is an arbitrary component, then each component is 
Zero. 
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V (fh) = fVA+AVE: 
IV (Fh)], = 0: (Fh) 
= fo;h+ hd; f 
= [fVh];, + [AVA]; 
= |[fVh+hVf], 


Eq. 417) 
product rule) 
Eq. 417) 


( 
( 
( 
(Eq. 24) 


Because 7 is a free index, the identity is proved for all components. 


V-(fA)=fV-A+A-Vf: 
V - (fA) = Or [fAl; 
= 0; (fAi) 
= foA; + Aidif 
=fV-A+A-Vf 


eVx(fA)=fVxA+VfXxA: 
[V x (fA)]; = eijn0; [FA], 
= €:jn0; (FAR) 
cae f€ij,0j; Ar T Eijk (O;f) Ay 
= feijnOj An + €ijx [VF]; Ak 
=[fV x A]; + [Vf x A]; 
=|fVxA+Vf x Al, 


Eq. 419) 
definition of index) 
product rule) 

Eqs. 419 & 413) 


~~ an 


Eq. 423) 

definition of index) 

product rule & commutativity) 
Eq. 417) 

Eqs. 423 & 494) 


( 
( 
( 
( 
( 
(Eq. 24) 


Because 7 is a free index, the identity is proved for all components. 


eA-(BxC)=C-(A x B) =B-(Cx A): 
A; (B x C) = jp AB, Cy 
= €4ij A;BjCh 
= Eig CrAiB; 
=C-(A xB) 
= eA DB; Cp 
€jki Dj CRA; 
=B-(Cx A) 


The negative permutations of this identity can be similarly obtained and proved by chang- 
ing the order of the vectors in the cross products which results in a sign change. 


© Ax (Bx C)=B(A-C)—C(A-B): 
[A x (B x C)]; = eijn Aj [B x C], 
= €ijnrAj€ximBiCm 
= ener Agim 
= eR ceinik Ay Pim 


= (did jm = JimOjt) A; BiCm 


= 6:15jmAjBiCm — 5imd;tAjBiCm 


= (6B) (6jmAjCm) — (OimCm) (6A; Bi) 


= B;(AmCm) — C; (A,By) 
= B(A-C)—C;,(A-B) 
= [B(A-C)]; —[C(A- B)]; 
= [B(A-C)—C(A-B)], 
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distributivity) 
commutativity & grouping) 
Eq. 555) 

Eq. 493) 

definition of index) 

Eq. 24) 


Tr i a a i aN 


Because 7 is a free index, the identity is proved for all components. The other variant 
of this identity, ie. (A x B) x C, can be obtained and proved similarly by changing the 
order of the factors in the external cross product with adding a minus sign. 


e Ax (V x B) =(VB):-A—A.- VB: 


[A x (V x B)], = 6344; [V x B], 
= 6 jrApcamOBa 
= ik Cis OLP ns 


= CpkeimbAGADm 


(di0jm — imdj1) AjOLBm 

=O Oi AG Op Dye = On 0 pacOp Dan, 
= AnO;Bm — AiOB; 

= (0;:Bm) Am — Ai (0; Bi) 

= [(VB) : A]; iad [A : VB); 

= |(VB)-A—A- VB], 


commutativity & grouping) 
Eq. 418 & § 3.2.5) 


Because 7 is a free index, the identity is proved for all components. 


eVx(VxA)=V(V-A)—-V°A: 
[V x (V x A)]; = ijn; [V x A], 


= 640i) 
a= €ijk€klmO; (Am) 
= Si iikOOra 
= (6205m — OimOjt) O;0iAm 

= 01072001 Am — Om i0;0l An 
= OmO0;Am — O,O;A; 

= 0; (OmAm) — OA; 

_ [V (V . A)]; = [V*A] 
=|V(V-A)-V’A] 


a 


z 


Eq. 423) 

Eq. 423) 

€ is constant) 

Eq. 180 & definition of derivative) 
Eq. 554) 

distributivity) 

Eq. 555) 

O shift, grouping & Eq. 2) 

Eqs. 419, 417 & 426) 


Because 7 is a free index, the identity is proved for all components. This identity can 
also be considered as an instance of the identity before the last one, observing that in the 
second term on the right hand side the Laplacian should precede the vector, and hence no 


independent proof is required. 
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eV(A-B)=Ax(VxB)+Bx(VxA)+(A-V)B+(B-V)A: 
We start from the right hand side and end with the left hand side: 
[A x (V x B)+Bx (V x A)+(A-V)B+(B-V) A], = 


) 


[A x (V x B)], + [B x (V x A)],+ [(A- V) BJ; + [(B- V) A]; = 


a 


(Eq. 24) 


€ijnAj [V X B], + €:32B; [V x A], + (A10;) B; + (Bid) Ai = 
(Eqs. 494, 419 & indexing) 
€ijkAj (€ximO1Bm) + €ijn-B; (€kimOAm) + (A10;) Bi + (BiO;) Ai = 
(Eq. 423) 
€ijn€kimAjOLBm + €ijk€nimBjO1Am + (AiO;) Bi + (Bids) Ai = 
(commutativity) 
€ijh€lmkAjBm + €ijk€imkBjOAAm + (AiO) Bi + (BiO)) Ai = 
(Eq. 180) 
(:5jm — 5im9jt) Aj Bm + (bi0jm — Oimdjt) BjOj:Am + (Ar0;) B; + (BiO)) Ai = 
(Eq. 554) 


(5:10jmAjOBm — bimOjtAj;O1Bm) + (6:10jmBjAAm — bimdjtBj)Am) 
+ (A),0;) B; + (B,0;) Ai = 
(distributivity) 
6:90jmAj;O.Bm — AiO:B; + 6:16jmBj;QAm — BiOAi + (AiO) By + (.Bi0;) Ai = 
(Eq. 555) 
6:9 jmAj0:Bm — (AiO;) Bi + 610jmBjQAm — (BiO;) Ai + (ArO;) Bi + (Bis) Ai = 
(grouping) 
O50 nA O1 Bag 0109 BAO AG = 
(cancellation) 
Anim t+ BpOAm = 
(Eq. 555) 
8: (AmBm) = 
(product rule) 
[V (A : B)); 
(Eqs. 417 & 419) 


Because 7 is a free index, the identity is proved for all components. 


eV-(AxB)=B-(V x A)—A-(V xB): 


V -(A x B) = 0; [A x B], (Eq. 419) 
= 0; (€:j,A;Br) (Eq. 494) 
= €i540; (A; Br) (€ is constant) 
= €ijn (BROjA; + AjO;Bx) (product rule) 
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= Epp BiO Ay + eA (distributivity) 
= ex BEOWAg — eipAjOi Bi (Eq. 180) 
= Br (gs) =A (Ga 02k) (commutativity & grouping) 
= B[V x A], — A; [V x B], (Eq. 423) 
=B-(VxA)—A-(V xB) (Eq. 493) 
eVx(AxB)=(B-V)A+(V:-B)A-(V-A)B-(A-V)B: 
[_V x (A x B)]; = 640; [A x B], (Eq. 423) 
= 960; (Chim Pn) (Eq. 494) 
= ik€kimO; (AiPm) (€ is constant) 
= €ijk€kim (BmO0; Ai + AO; Bm) (product rule) 
= Gyktimng (PaO Art Ao, Ba) (Eq. 180) 
= (6i0jm — Simo) (BmOj;A1 + AO; Bm) (Eq. 554) 
= O10; BaO;Al 010m AIO; Pm 
bimOjtBmO;A1 — imdj1A10; Bm (distributivity) 
= BiOmAi + AiOmBm — BiO;A; — -Aj0;Bi; (Eq. 555) 
= (BmOm) Ai + (OmBm) Ai — (0;A;) Bi — (Aj;0;) Bi (grouping) 
= [((B-V) AJ, + [(V -B) ve 
[((V - A) B], — [((A- V) BI, (Eqs. 413 & 419) 
=(B-V)A+(V-B)A-—(V-A)B-(A-V)BJ], (Eq. 24) 


Because 7 is a free index, the identity is proved for all components. 
A-C A-D 
B-C B-D 
(A x B)-(C x D) = [A x BJ, [C x Dj, 
= €j4A;Br€imC,D 
= €ijn€ilmAj BrCiD 
= (4;154m — OjmOn) A; ByCLD 
= 0;10kmAjBECIDm — OjmdnAjBrCiDm 


e (Ax B)-(Cx D)= 


= (6;.A;C;) praee — (0jmA;Dm) (51 BeCr) 
= (A\C;) (ee Dyn) = (AmDm) (BiCi) 
= (A-C)(B-D)—(A-D)(B-C) 
A-C A-D 
-| B-C B-D 2] 
e (A x B) x (C x D) = [D- (A x B)| C— [C- (A x B)|D: 
[((A x B) x exp. = ijk [A x BJ, [C x D], 


=F = eke mma nein pby 


(Eq. 493) 

(Eq. 494) 
(commutativity) 
(Eqs. 180 & 554) 
(distributivity) 
(commutativity) 
(Eq. 555) 

(Eq. 493) 


(definition) 


(Eq. 494) 
(Eq. 494) 
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= Cy bebsae mmm Pa CE bd 

= Cighepakoma am Pal pe 

= (dipdjq — Fig5jp) €jmnAmBnCpDg 

= (dipdjq€jmn — 9igdjp€jmn) AmBnCpDg 

= (Sip€gmn — Sigpmn) AmBnCpDg 

SO seqnin Anne = tema pL 
=m biG Ds = Gran Papi 

= Cipla Adin Cs = conn alin 
(Cpt oAniBn) Ce (opie pAmien) Dy 
[D - (A x B)| C; —[C- (A x B)| D; 
([D- (A x B)]C], - [IC -(A x B)]DI, 
= [[D-(A x B)|C-[C-(A x BDI, 


commutativity) 


distributivity) 
distributivity) 


commutativity) 


grouping) 


index definition) 


( 
( 
( 
( 
( 
( 
( 
( 
( 
( 
( 
(Eq. 24) 


Because 7 is a free index, the identity is proved for all components. 


7.2 Tensors in Geometry 


In this section, we give some examples of tensor calculus applications in geometry. These 
examples are related mainly to the investigation of the properties of spaces in general and 
space curvature in particular and hence they play important roles in non-Euclidean geome- 
tries and related applications in the physical sciences such as geometry-based gravitational 
theories. 


7.2.1 Riemann-Christoffel Curvature Tensor 


This absolute rank-4 tensor, which is also called Riemann curvature tensor and Riemann- 
Christoffel tensor, is a property of the space. It characterizes important properties of 
spaces and hence it plays an important role in geometry in general and in non-Euclidean 
geometries in particular with many applications in geometry-based physical theories. The 
tensor is used, for instance, to test for the space flatness (see § 2.1). As indicated before, 
the Riemann-Christoffel curvature tensor vanishes identically 2ff the space is globally flat 
and hence the Riemann-Christoffel curvature tensor is zero in Euclidean spaces. The 
Riemann-Christoffel curvature tensor depends only on the metric which, in general co- 
ordinate systems, is a function of position and hence the Riemann-Christoffel curvature 
tensor follows this dependency on position. Yes, for affine coordinate systems the metric 
tensor is constant and hence the Riemann-Christoffel curvature tensor vanishes identically. 

There are two kinds of Riemann-Christoffel curvature tensor: first and second, where the 
first kind is a type (0, 4) tensor while the second kind is a type (1,3) tensor. The Riemann- 
Christoffel curvature tensor of the first kind, which may also be called the covariant (or 
totally covariant) Riemann-Christoffel curvature tensor, is defined by: 


Riga = Ox [91,2] — O[9k, a] + [il, 7] Uy, — (0k, er] TY (558) 
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1 
as (W059 + AO:9;% — O-0i9;1 — HO; 9xi) + (tl, 7] , — (tk, r] Thy 

1 : . . : 
a (O,0; ui “i O09 jk = O,0;951 = O10; Ii) st! ([2, r| Lk, s| = lik, r| gl, s}) 


2 


where the second step is based on Eq. 355 while the third step is based on Eq. 308. We 
note that the first line of Eq. 558 can be cast in the following mnemonic determinantal 
form: 

Or O; Dik M 
Lk] [gl] tk, r] [alr] 


Similarly, the Riemann-Christoffel curvature tensor of the second kind, which may also 
be called the mixed Riemann-Christoffel curvature tensor, is given by: 


ge | (559) 


We note again that Eq. 560 can be put into the following mnemonic determinantal form: 


O, A 


bo Di Ute 
re eel Pee Tey 


rl 


Rint a | 


‘ 


(561) 


On lowering the contravariant index of the Riemann-Christoffel curvature tensor of the 
second kind, the Riemann-Christoffel curvature tensor of the first kind is obtained, that 
is: 

Rij = Jia kr Kt (562) 
Similarly, the Riemann-Christoffel curvature tensor of the second kind can be obtained by 
raising the first index of the Riemann-Christoffel curvature tensor of the first kind, that 
is: 

Rj = 9 Rage (563) 

One of the main applications of the Riemann-Christoffel curvature tensor in tensor cal- 
culus is demonstrated in its role in tensor differentiation. As seen in § 5.2, the covariant 
differential operators in mixed derivatives are not commutative and hence for a covariant 
vector A we have: 


Ajjnt — Ajak = Rij) Ai (564) 


From Eq. 564, it is obvious that the mixed second order covariant derivatives are equal 
iff the Riemann-Christoffel curvature tensor of the second kind vanishes identically which 
means that the space is flat and hence the covariant derivatives are ordinary partial deriva- 
tives. Similarly, for the mixed second order covariant derivative of a contravariant vector 
A we have: ; 

Arey = Av — Rin” (565) 
which is similar to Eq. 564 for a covariant vector A. In brief, the covariant differential 
operators become commutative when the metric makes the Riemann-Christoffel curvature 
tensor of either kind vanish. So, since the Riemann-Christoffel curvature tensor is zero in 
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Euclidean spaces, the mixed second order covariant derivatives, which become ordinary 
partial derivatives, are equal when the C? continuity condition is satisfied. 

The Riemann-Christoffel curvature tensor of the first kind satisfies the following sym- 
metric and skew-symmetric relations in its four indices: 


Rijn = Rei (block symmetry) (566) 
= Hag (anti-symmetry in the first two indices) 
= =A (anti-symmetry in the last two indices) 


The skew-symmetric property of the covariant Riemann-Christoffel curvature tensor with 
respect to the last two indices also applies to the mixed Riemann-Christoffel curvature 
tensor, that is: 

Rit = Rix (567) 
As a consequence of the two anti-symmetric properties of the covariant Riemann-Christoffel 
curvature tensor, the entries of the Riemann-Christoffel curvature tensor with identical val- 
ues of the first two indices (e.g. Rj.) or/and the last two indices (e.g. Rjj,,) are zero 
(refer to § 3.1.5). Also, as a consequence of these anti-symmetric properties, all the entries 
of the tensor with identical values of more than two indices (e.g. Rjj;;) are zero. 

We remark that all the above symmetric and anti-symmetric properties of the Riemann- 
Christoffel curvature tensor of the first and second kinds can be proved by using the above 
definitions of this tensor. For example, the first anti-symmetric property can be verified 
as follows: 


1 : ; : . 
Lge a 5) (OnOi guj a O10; Gik _ OnO; Git _ O10; 9x;) eg © ([7l, r| [7k, 5] — [jk, r| [20, s|) (568) 


1 . : . ‘ 
ie E (O,O;Ga + HOGe; — OOiG13 — WO; Gin) + 9"° (9k, 7] lil, 8] — [71,7] [tk, s ) 


1 
— E (O10; 9 + O09 jk = OpOi9;1 _ O10; 9xi) ge ( al, 5| Lik, r| — |tk, $] Lil, r ) 


1 
— E (0,0; 911 + O10:9;% — On0i951 — O10; 9%i) + 9”° ([tl, r] [Jk, 5] — [tk, r] [yl, 5 ) 
= Rijn 


where in the first line we use the third line of Eq. 558 with exchanging 7 and 7, in the 
second line we take a factor of —1, in the third line we use the symmetry of the metric 
tensor in its two indices, in the fourth line we relabel two dummy indices, and in the last 
line we use the third line of Eq. 558 again. 

In an nD space, the Riemann-Christoffel curvature tensor has n* components. As a 
consequence of the symmetric and anti-symmetric properties of the covariant Riemann- 
Christoffel curvature tensor, there are three types of distinct non-vanishing entries: 

1. Entries with only two distinct indices (type Rj;;;) which count: 


n(n—1) 


No = 9 


(569) 
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2. Entries with only three distinct indices (type Rj;;,) which count: 
n(n —1)(n—2) 


ie ; (570) 
3. Entries with four distinct indices (type Rjj;4.) which count: 
—1 —2 —3 
Ny = R= Dln=2) (0-3) 


12 


The numerator in these three equations represents the k-permutations of n distinct objects 
which is given by: 

n! 
(n—k)! 
where k in these cases is equal to 2,3, 4 respectively, while the denominator represents the 
number of non-independent ways of generating these entries due to the anti-symmetric 
and block symmetric properties. 

By adding the numbers of the three types of non-zero distinct entries, as given by Eqs. 
569-571, we can see that the Riemann-Christoffel curvature tensor in an nD space has a 
total of: 


P(n,k) = (572) 


2 (m2 

ttt ol, 
Nea = No+ Ng +My =" 9) 
independent components which do not vanish identically. For example, in a 2D Rieman- 
nian space the Riemann-Christoffel curvature tensor has 24 = 16 components; however 
there is only one independent component (with the principal suffix 1212) which is not 
identically zero represented by the following four interdependent components: 


(573) 


R212 = Roi = —Ry221 = —Roi2 (574) 


Similarly, in a 3D Riemannian space the Riemann-Christoffel curvature tensor has 34 = 81 
components but only six of these are distinct non-zero entries which are the ones with the 
following principal suffixes: 


1919; 1313,.1013-°0193 3139: 9393 (575) 


where the permutations of the indices in each one of these suffixes are subject to the 
symmetric and anti-symmetric properties of the four indices of the Riemann-Christoffel 
curvature tensor, as in the case of a 2D space in the above example, and hence these 
permutations do not produce independent entries. Following the pattern in the examples 
of 2D and 3D spaces, in a 4D Riemannian space the Riemann-Christoffel curvature tensor 
has 44 = 256 components but only 20 of these are independent non-zero entries, while in a 
5D Riemannian space the Riemann-Christoffel curvature tensor has 54 = 625 components 
but only 50 are independent non-zero entries. 

On contracting the first covariant index with the contravariant index of the Riemann- 
Christoffel curvature tensor of the second kind we obtain: 


Rog = TY — OG, + THT, — TET (j =7 in Eq. 560) (576) 
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= 0,1%, — oT, + Tr, -TT% (relabeling dummy 7,7 in last term) 
= ny = OV, 
= Op [01 (In V/g)| — O71 [Dx (In V9) (Eq. 324) 
= 0,0, (In \/g) — 0,0; (In /9) 
= nO; (In \/g) — On (In \/9) (C? continuity condition) 
=0 
That is: . 
Rep =o (577) 


This is inline with the anti-symmetric property of the first two indices of the totally 
covariant Riemann-Christoffel curvature tensor. 
Finally, the Riemann-Christoffel curvature tensor also satisfies the following identity: 


Ragria + Pagike = Aapeng  agjat (578) 


As indicated earlier, a necessary and sufficient condition for a space to have a coordinate 
system such that all the components of the metric tensor are constants is that: 


i= 0 (579) 
This can be concluded from Eq. 558 plus Eqs. 307 and 308. 


7.2.2 Bianchi Identities 


The Riemann-Christoffel curvature tensor of the first and second kind satisfies a number 
of identities called the Bianchi identities. The first Bianchi identity is given by: 


Rizr Ritjk Riki; = 0 (first kind) (580) 
Rint Ryn Rij = 0) (second kind) (581) 


These two forms of the first Bianchi identity can be obtained from each other by the index 
raising and lowering operators. The first Bianchi identity, as stated above, is an instance 
of the fact that by fixing the position of one of the four indices and permuting the other 
three indices cyclically, the algebraic sum of these three permuting forms is zero, that is: 


Ruse} Raia He = 0 (i fixed) (582) 
Rijnt + Rijsx + Regu = 0 (7 fixed) (583) 
Feri Ruge = Dyies = 0 (k fixed) (584) 
hinit hea he —0 (I fixed) (585) 


All the above identities can be easily proved by using the definition of the Riemann- 
Christoffel curvature tensor noting that the Christoffel symbols of both kinds are symmet- 
ric in their paired indices as given by Eqs. 310 and 311. For example, Eq. 580 can be 
verified by substituting from the first line of Eq. 558 into Eq. 580, that is: 


Rijn + Rage + Rig = Ov [9l,4] — O:[9k, t] + [il 7] Uy — (th, 7] + (586) 
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Leptin T Reinke an Fak = 0 (first kind) (587) 
R' +R p=U (second kind) (588) 


4 4 L 
jklzm jlm;k 

Again, these two forms can be obtained from each other by the raising and lowering oper- 
ators. We note that the pattern of the second Bianchi identity in its both kinds is simple, 
that is the first two indices are fixed while the last three indices are cyclically permuted 
in the three terms. It is noteworthy that the Bianchi identities are valid regardless of the 


space metric. 


7.2.3 Ricci Curvature Tensor and Scalar 


The Ricci curvature tensor, which is a rank-2 absolute symmetric tensor, is a byproduct of 
the Riemann-Christoffel curvature tensor and hence it plays a similar role in characterizing 
the space and describing its curvature. There are two kinds of Ricci curvature tensor: first 
and second, where the first kind is a type (0, 2) tensor while the second kind is a type (1, 1) 
tensor. The first kind of this tensor is obtained by contracting the contravariant index 
with the last covariant index of the Riemann-Christoffel curvature tensor of the second 
kind, that is: 

SO eee La oe OV ia — al; + TeV ia ~ Vials; (589) 
where Eq. 560 is used in this formulation. The Ricci curvature tensor, as given by the 
last equation, can be written in the following mnemonic determinantal form: 


= 0; Oa Be i 
Ri; -| re re, 7 re. re (590) 


Because of Eq. 324 (i.e. Ey =O; (In JV9)); the Ricci tensor can also be written in the 
following forms as well as several other forms: 


Ri = 0;0;(In J/g) — OG, +1303, — 13,0 (In /9) (591) 


1 
= 0; (In ¥9) + PhP, — Fed (VITS) 
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where g is the determinant of the covariant metric tensor. As stated above, the Ricci 
tensor of the first kind is symmetric, that is: 


Rij = Ry (592) 


This can be easily verified by exchanging the 7 and 7 indices in the last line of Eq. 591 
taking account of the C? continuity condition and the fact that the Christoffel symbols 
are symmetric in their paired indices (Eq. 311). 
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On raising the first index of the Ricci tensor of the first kind, the Ricci tensor of the 
second kind is obtained, that is: 
as Res (593) 
This process can be reversed and hence the Ricci tensor of the first kind can be obtained 
from the Ricci tensor of the second kind using the index lowering operator. In an nD 
space, the Ricci tensor has n? entries. However, because of its symmetry the number of 
its distinct entries is reduced to: 
n(n+1) 
2 
The Ricci scalar R, which is also called the curvature scalar and the curvature invariant, 
is the result of contracting the indices of the Ricci tensor of the second kind, that is: 


C= (595) 


Nap = (594) 


Since the Ricci scalar is obtained by raising a covariant index of the Ricci tensor of the 
first kind using the raising operator followed by contracting the two indices, it may be 
written by some in the following form: 
- a2 1 
R= 9" Ry; = 9" 0,0; (In V9) as iB =O, (/g0%,) (596) 
VI 

where the expression inside the square brackets is obtained from the last line of Eq. 591. 
Similar expressions can be obtained from the other lines of that equation. 


7.3  Tensors in Science 


In this section, we give a few examples of tensor calculus applications in science. These 
examples come mainly from the disciplines of continuum mechanics and fluid dynamics. 
For simplicity, clarity and widespread use we employ a Cartesian approach in the following 
formulations. 


7.3.1 Infinitesimal Strain Tensor 


This is a rank-2 tensor which describes the state of strain in a continuum medium and 
hence it is used in continuum and fluid mechanics. The infinitesimal strain tensor 7+ is 
defined by: 

_ Vd+(Vd)" 

awe 2 

where d is the displacement vector and the superscript 7’ represents matrix transposition. 
We note that some authors do not include the factor 5 in the definition of y. The dis- 
placement vector d represents the change in distance and direction which an infinitesimal 
element of the medium experiences as a consequence of the applied stress. In Cartesian 
coordinates, the last equation is given in tensor notation by the following form: 


O;d; + Oj; 
Vig = <5 


(597) 


(598) 
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7.3.2 Stress Tensor 


The stress tensor, which is also called Cauchy stress tensor, is a rank-2 tensor used for 
transforming a normal vector to a surface to a traction vector acting on that surface, that 
is: 

T=o-n (599) 
where T is the traction vector, o is the stress tensor and n is the normal vector. This is 
usually expressed in tensor notation using Cartesian coordinates by the following form: 


T; = O4jN; (600) 


We should remark that the stress tensor is symmetric in many applications (e.g. in the 
flow of Newtonian fluids) but not all, as it can be asymmetric in some cases. We also 
remark that we chose to define the stress tensor within the context of Cauchy stress law 
which is more relevant to the continuum mechanics, although it can be defined differently 
in other disciplines and in a more general form. 

The diagonal components of the stress tensor represent normal stresses while the off- 
diagonal components represent shear stresses. Assuming that the stress tensor is sym- 
metric, it possesses ue independent components, instead of n?, where n is the space 
dimension. Hence in a 3D space (which is the natural space for this tensor in the common 
physical applications) it has six independent components. In fluid dynamics, the stress 
tensor (which may also be labeled as the total stress tensor) is decomposed into two main 
parts: a viscous part and a pressure part. The viscous part may then be split into a 
normal stress and a shear stress while the pressure part may be split into a hydrostatic 


pressure and an extra pressure of hydrodynamic nature. 


7.3.3. Displacement Gradient Tensors 


These are rank-2 tensors denoted by E and A. They are defined in Cartesian coordinates 
using tensor notation as: 


Ox; Ox. 
a Z A a ig 1 
S Ox; a On oe 


where the indexed x and 2’ represent the Cartesian coordinates of an observed continuum 
particle at the present and past times respectively. These tensors may also be called 
the deformation gradient tensors. The first displacement gradient tensor E quantifies the 
displacement of a particle at the present time relative to its position at the past time, 
while the second displacement gradient tensor A quantifies its displacement at the past 
time relative to its position at the present time. From their definitions, it is obvious that 
E and A are inverses of each other and hence: 


EixAnj = 6i (602) 
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7.3.4 Finger Strain Tensor 


This rank-2 tensor, which may also be called the left Cauchy-Green deformation tensor, is 
used in the fields of fluid and continuum mechanics to describe the strain in a continuum 
object, e.g. fluid, in a series of time frames. It is defined as: 


B=E-E’ (603) 


where E is the first displacement gradient tensor as defined in § 7.3.3 and the superscript 
T represents matrix transposition. The last equation can be expressed in tensor notation, 
using Cartesian coordinates, as follows: 


OE OU. 


(604) 


where the indexed x and x’ represent the Cartesian coordinates of an element of the 
continuum at the present and past times respectively. 


7.3.5 Cauchy Strain Tensor 


This tensor, which may also be called the right Cauchy-Green deformation tensor, is 
the inverse of the Finger strain tensor (see § 7.3.4) and hence it is denoted by Bvt. 
Consequently, it is defined as: 

BOS Ariz (605) 


where A is the second displacement gradient tensor as defined in § 7.3.3. The last equation 
can be expressed in tensor notation, using Cartesian coordinates, as follows: 


/ / 
Ai OL, OF», 
9 Ox; Ox; 


(606) 


The Finger and Cauchy strain tensors may be called “finite strain tensors” as opposite 
to infinitesimal strain tensor (see § 7.3.1). They are symmetric positive definite tensors; 
moreover they become the unity tensor when the change in the state of the object from 
the past to the present times consists of rotation and translation with no deformation. 


7.3.6 Velocity Gradient Tensor 


This is a rank-2 tensor which is often used in fluid dynamics and rheology. As its name 
suggests, it is the gradient of the velocity vector v and hence it is given in Cartesian 
coordinates by: 

[Vv]; = A; (607) 


The velocity gradient tensor in other coordinate systems can be obtained from the expres- 
sion of the gradient of vectors in these systems, as given, for instance, in § 6.4 and § 6.5 
for cylindrical and spherical coordinates. The term “velocity gradient tensor” may also be 
used for the transpose of this tensor, i.e. (Vv)’. The velocity gradient tensor is usually 
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decomposed into a symmetric part which is the rate of strain tensor S (see § 7.3.7), and 
an anti-symmetric part which is the vorticity tensor S (see § 7.3.8), that is: 


Vv=S+S (608) 
As seen earlier (refer to § 3.1.5), any rank-2 tensor can be decomposed into a symmetric 
part and an anti-symmetric part. 


7.3.7 Rate of Strain Tensor 


This rank-2 tensor, which is also called the rate of deformation tensor, is the symmetric 
part of the velocity gradient tensor and hence it is given by: 


y 
S= wt Wy (609) 
which, in tensor notation with Cartesian coordinates, becomes (see Eq. 99): 
ivy + jv, 
Siz = a a (610) 


We note that some authors do not include the factor 5 in the definition of S and S and 
hence this factor is moved to the definition of Vv (Eq. 608). Also, the tensors S and S 
are commonly denoted by ¥ and w respectively. 
The rate of strain tensor is a quantitative measure of the local rate at which neighboring 
material elements of a senanias continuum move with respect to each other. As a rank-2 
nt 


symmetric tensor, it has so) independent components which are six in a 3D space. The 


rate of strain tensor is related to the infinitesimal strain tensor (refer to § 7.3.1) by: 
=e (611) 


where ¢ is time. Hence, the rate of strain tensor is normally denoted by ¥ where the dot 
represents the temporal rate of change, as indicated above. 


7.3.8 Vorticity Tensor 


This rank-2 tensor is the anti-symmetric part of the velocity gradient tensor and hence it 
is given by: 


T 
5 _ Vv Wy) (612) 
which, in tensor notation with Cartesian coordinates, becomes (see Eq. 100): 
= _ div; — dv, 
j,-— (613) 


The vorticity tensor quantifies the local rate of rotation of a deforming continuum medium. 
As a rank-2 anti-symmetric tensor, it has no) independent non-zero components which 
are three in a 3D space. These three components, added to the aforementioned six com- 
ponents of the rate of strain tensor (see § 7.3.7), give nine independent components which 
is the total number of components of their parent tensor Vv. 
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fon 


fee 


7.3 
7.4 


7.5 


7.6 


tot 


128 


ie) 


7.10 


Summarize the reasons for the popularity of tensor calculus techniques in mathemat- 
ical, scientific and engineering applications. 

State, in tensor language, the definition of the following mathematical concepts as- 
suming Cartesian coordinates of a 3D space: trace of matrix, determinant of matrix, 
inverse of matrix, multiplication of two compatible square matrices, dot product of 
two vectors, cross product of two vectors, scalar triple product of three vectors and 
vector triple product of three vectors. 

From the tensor definition of Ax (B x C), obtain the tensor definition of (A x B)xC. 
We have the following tensors in orthonormal Cartesian coordinates of a 3D space: 


A = (22, 31, 6.3) B = (3¢, 1.8, 4.9) C = (47, 5e, 3.5) 


nm 3 3. -€ 

Pa[be] Bele 7 
Use the tensor expressions for the relevant mathematical concepts with systematic 
substitution of the indices values to find the following: tr(D), det (E), D~', E- D, 
A-C,Cx B,C. (A x B) and B x (C x A). 
State the matrix and tensor definitions of the main three independent scalar invariants 
(J, II and JT) of rank-2 tensors. 
Express the main three independent scalar invariants (J, JJ and IJ) of rank-2 tensors 
in terms of the three subsidiary scalar invariants (11, [2 and J3). 
Referring to question 7.4, find the three scalar invariants (J, JJ and III) of D and 
the three scalar invariants (J,, J2 and J;) of E using the tensor definitions of these 
invariants with systematic index substitution. 
State the following vector identities in tensor notation: 


Ver = 0 
V-(fA) = fV-A+A-Vf 
A x (V x B) (VB)-A—A-VB 
Vx(AxB) = (B-V)A+(V-B)A-(V-A)B-(A-V)B 


State the divergence and Stokes theorems for a vector field in Cartesian coordinates 
using vector and tensor notations. Also, define all the symbols involved. 

Prove the following vector identities using tensor notation and techniques with full 
justification of each step: 


I 
os) 
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td 


7.12 


7.13 


7.14 


7.15 


7.16 


7.17 


7.18 


7.19 


7.20 


(eral 


1.22 


7.23 


7.24 
7.25 


7.26 


ce 
7.28 


7.29 
7.30 


What is the type, in the form of (m,n, w), of the Riemann-Christoffel curvature tensor 
of the first and second kinds? 

What are the other names used to label the Riemann-Christoffel curvature tensor of 
the first and second kinds? 

What is the importance of the Riemann-Christoffel curvature tensor with regard to 
characterizing the space as flat or curved? 

State the mathematical definition of the Riemann-Christoffel curvature tensor of either 
kinds in determinantal form. 

How can we obtain the Riemann-Christoffel curvature tensor of the first kind from 
the second kind and vice versa? 

Using the definition of the second order mixed covariant derivative of a vector field 
(see § 5.2) and the definition of the mixed Riemann-Christoffel curvature tensor, verify 
the following equation: Aj.) — Aj.1, = Ri Ai. Repeat the question with the equation: 
AP, yy Aly = RAL 

Based on the equations in question 7.16, what is the necessary and sufficient condition 
for the covariant differential operators to become commutative? 

State, mathematically, the anti-symmetric and block symmetric properties of the 
Riemann-Christoffel curvature tensor of the first kind in its four indices. 

Based on the two anti-symmetric properties of the covariant Riemann-Christoffel cur- 
vature tensor, list all the forms of the components of the tensor that are identically 
zero (e.g. Riijk)- 

Verify the block symmetric property and the two anti-symmetric properties of the 
covariant Riemann-Christoffel curvature tensor using its definition. 

Repeat question 7.20 for the anti-symmetric property of the mixed Riemann-Christoffel 
curvature tensor in its last two indices. 

Based on the block symmetric and anti-symmetric properties of the covariant Riemann- 
Christoffel curvature tensor, find (with full justification) the number of distinct non- 
vanishing entries of the three main types of this tensor (see Eqs. 569-571). Hence, 
find the total number of the independent non-zero components of this tensor. 

Use the formulae found in question 7.22 and other formulae given in the text to find 
the number of all components, the number of non-zero components, the number of 
zero components and the number of independent non-zero components of the covariant 
Riemann-Christoffel curvature tensor in 2D, 3D and 4D spaces. 

Prove the following identity with full justification of each step of your proof: R%,, = 0. 
Make a list of all the main properties of the Riemann-Christoffel curvature tensor (i.e. 
rank, type, symmetry, etc.). 

Prove the following identity using the Bianchi identities: Rijs; + Rijrs = Riksi + 
Rany st: 

Write the first Bianchi identity in its first and second kinds. 

Verify the following form of the first Bianchi identity using the mathematical definition 
of the Riemann-Christoffel curvature tensor: Ry. + Reijt + Ryka = 0. 

What is the pattern of the indices in the second Bianchi identity? 

Write the determinantal form of the Ricci curvature tensor of the first kind. 
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7.31 


7.32 
7.33 
7.34 


7.35 


7.36 


7.37 


7.38 
7.39 
7.40 
7.41 
7.42 
7.43 
7.44 


7.45 


7.46 


Starting from the determinantal form of the Ricci curvature tensor of the first kind, 
obtain the following form of the Ricci curvature tensor with justification of each step 
in your derivation: Rj; = 0,0; (In /g) +TZT?, — Fq0a (Jars): 

Verify the symmetry of the Ricci tensor of the first kind in its two indices. 

What is the number of distinct entries of the Ricci curvature tensor of the first kind? 
How can we obtain the Ricci curvature scalar from the covariant Riemann-Christoffel 
curvature tensor? Write an orderly list of all the required steps to do this conversion. 
Make a list of all the main properties of the Ricci curvature tensor (rank, type, sym- 
metry, etc.) and the Ricci curvature scalar. 

Outline the importance of the Ricci curvature tensor and the Ricci curvature scalar 
in characterizing the space. 

Write, in tensor notation, the mathematical expressions of the following tensors in 
Cartesian coordinates defining all the symbols involved: infinitesimal strain tensor, 
stress tensor, first and second displacement gradient tensors, Finger strain tensor, 
Cauchy strain tensor, velocity gradient tensor, rate of strain tensor and vorticity 
tensor. 

Which of the tensors in question 7.37 are symmetric, anti-symmetric or neither? 
Which of the tensors in question 7.37 are inverses of each other? 

Which of the tensors in question 7.37 are derived from other tensors in that list? 
What is the relation between the first and second displacement gradient tensors? 
What is the relation between the velocity gradient tensor and the rate of strain tensor? 
What is the relation between the velocity gradient tensor and the vorticity tensor? 
What is the relation between the rate of strain tensor and the infinitesimal strain 
tensor? 

What are the other names given to the following tensors: stress tensor, deformation 
gradient tensors, left Cauchy-Green deformation tensor, Cauchy strain tensor and rate 
of deformation tensor? 

What is the physical significance of the following tensors: infinitesimal strain tensor, 
stress tensor, first and second displacement gradient tensors, Finger strain tensor, 
velocity gradient tensor, rate of strain tensor and vorticity tensor? 
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168, 170, 172 
Kronecker delta, 132 
metric tensor, 7, 39, 43, 52, 70, 92, 93, 96, 109, 
110, 114, 124, 125, 132, 142 
partial derivative, 11 
permutation tensor, 8, 84, 100, 107, 132 
tensor, 32, 50-55, 65, 99, 121, 143, 153 


tensor derivative, 11 
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Coordinate 
curve, 24-27, 35-40, 43-48, 53, 55, 103-106, 
123 
surface, 24-26, 35-40, 43-46, 48, 53, 55, 105, 
110, 111 


system, 7, 10, 12, 138, 15, 16, 20, 22-36, 39- 
41, 44-51, 53, 55, 57, 58, 62, 70-72, 74-77, 
83, 91-95, 97, 99, 102, 103, 105, 108-113, 
118-121, 123-125, 128, 129, 132-138, 140, 
144-148, 150, 152, 158, 161, 167, 171, 175 
Covariant 
basis vector, 6, 24-27, 39-41, 43-45, 48, 49, 
53-55, 69-71, 74, 91, 92, 99, 109, 113, 114, 
132 
component, 43, 48, 49, 54, 69, 71, 97, 100, 101, 
121, 141, 143 
derivative, 6, 11, 120-131, 133-136, 141-143, 
145, 168, 178 
differentiation, 10, 112, 120-132, 134-136 
index, 51-53, 58, 93, 122, 137, 152, 170, 172, 
173 
Kronecker delta, 132 
metric tensor, 7, 39, 42, 43, 52, 53, 70, 81, 92— 
94, 96, 99, 103, 105, 108-110, 114, 124, 
125, 132, 142, 172 
permutation tensor, 8, 99, 132 
Riemann-Christoffel curvature tensor, 167, 169, 
171, 178 
tensor, 32, 50-55, 64, 65, 67, 99, 110, 121, 145 
Cross product, 43, 56, 57, 84, 99, 100, 103, 104, 110, 
137, 139, 143, 152, 157, 163, 164, 177 
Curl, 6, 56, 72, 139, 143, 144, 147, 149, 151-154 
Curvature 
invariant, 173 
scalar, 173 
Curve, 6, 7, 11, 12, 23-25, 35, 36, 53, 76, 98, 99, 
103, 110, 129-132, 136 
Curved space, 22-24, 46, 94, 109, 110, 178 
Curvilinear coordinate system, 24-26, 30, 34, 36, 
40, 47, 48, 71, 120, 121, 123, 128, 133, 134, 
137 
Cylindrical coordinate system, 6, 8, 24-26, 28, 35, 
37, 39, 47, 48, 71, 95, 110, 111, 1138, 117- 
119, 134, 137, 148, 150, 152, 153, 175 


Determinant, 6, 7, 11, 29, 30, 42, 43, 53, 72, 81, 85— 
89, 91, 92, 94, 99, 102, 105, 108-110, 115, 
126, 142, 156, 158, 172, 177 

Diagonal matrix, 44, 95 

Differentiable, 31, 121-124, 127, 129-131, 134-136, 
138-144, 147, 149-152, 158, 160, 161 

Differential 


calculus, 11 
geometry, 12, 17, 24, 119, 155 
operator, 6, 10, 11, 51, 52, 121, 124, 126, 135, 
137, 138, 140, 146, 152, 168, 178 
Dimension of space, 6, 12, 15-18, 20, 24, 29, 30, 39, 
44, 46, 58, 64, 65, 72, 76, 77, 82, 83, 107, 
119, 156, 158, 174 
Direct 
multiplication, 64-66 
notation, 12 
product, 56-58, 64, 65, 73 
transformation, 29 
Displacement 
gradient tensor, 6, 7, 174, 175, 179 
vector, 6, 50, 102-106, 173 
Distributive, 18, 64, 65, 67, 73 
Distributivity, 163-165, 167 
Divergence, 6, 137, 189, 142-145, 147, 149, 151-154 
theorem, 160, 177 
Dot product, 27, 42, 43, 53, 66-68, 74, 92, 93, 97, 
98, 110, 114, 129, 130, 139, 156, 177 
Double inner product, 68, 83, 84 
Dummy index, 9, 14-18, 20, 52, 169 
Dyad, 6, 55, 56, 65, 68, 72, 83, 122, 148-151 
Dyadic 
multiplication, 64 
product, 54 


Ellipsoid, 23, 94 

epsilon-delta identity, 87, 90, 108 
Euclidean, 22—24, 27, 46, 94, 134, 167, 169 
Exterior multiplication, 64 

Extra pressure, 174 


Finger strain tensor, 6, 175, 179 
Flat 
metric, 94 
space, 23, 27, 31, 46, 47, 94, 96, 109, 110, 127, 
167, 168, 178 
Fluid 
dynamics, 138, 173-175 
mechanics, 69, 155, 173, 175 
Form-invariance, 77 
Form-invariant, 12, 13 
Free index, 9, 14-18, 20, 51, 52, 59, 63, 66, 123, 
161-167 


Gauss theorem, 155 
General 
coordinate system, 1, 7, 10, 11, 20, 22, 24, 27, 
39, 41-43, 53, 57, 66, 70, 72—74, 91-93, 97, 
99, 100, 102, 103, 105, 106, 108-112, 119, 
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120, 122, 133, 136, 187, 140, 141, 144-146, 
148, 150, 152, 153, 161, 167 
tensor, 62, 112, 120, 126, 133, 136, 143 
Generalized Kronecker delta, 7, 76, 82, 88-90, 108, 
126 
Gibbs notation, 12 
Gradient, 6, 32, 39, 48, 45, 50, 129, 186-142, 144, 
145, 147, 149, 150, 152-154, 175 
vector, 32, 43 


Handedness, 32, 55, 81, 105 
Homogeneous coordinate system, 23, 27, 29, 47, 110 
Hydrostatic pressure, 174 


Identity, 32 
tensor, 22, 43, 76, 83, 91, 94 
transformation, 34, 48 
Imaginary, 9, 11, 27, 29, 92, 96 
Improper 
rotation, 58 
transformation, 32-34, 48, 55, 77, 78, 81, 107 
Index 
-free notation, 12, 13 
notation, 9 
Indicial 
notation, 12, 13, 17, 19, 126, 135 
structure, 17, 18, 20, 51, 52, 58, 63, 64 
Infinitesimal, 6, 22, 23, 34, 46, 91, 103-106, 108, 
111, 173 
strain tensor, 7, 173, 175, 176, 179 
Inner 
multiplication, 73, 74, 94, 113 
product, 6, 57, 58, 64-69, 74, 83, 84, 94, 97, 98, 
113, 124, 126, 127, 129, 131, 156 
Integral, 105, 110 
theorem, 155, 160, 161 
Intrinsic derivative, 11, 130, 136 
Invariance, 12, 16, 19, 71-73, 91, 112, 120 
Invariant, 12, 18, 31, 32, 41, 47, 55, 58, 61, 62, 73, 
76, 77, 98, 103, 121, 139, 141, 153, 155, 
157, 158, 177 
Inverse, 32 
Jacobian, 7, 30, 31 
of matrix, 44, 95, 156, 177 
of metric, 44, 91, 92, 95, 109 
of tensor, 174, 175, 179 
transformation, 29-31, 47 
Invertible 
matrix, 43, 91, 95, 158 
transformation, 30, 31 
Irrotational, 160 
Isotropic, 23 


tensor, 58, 72, 76-78, 81 


Jacobian, 7, 29-33, 39, 42, 47, 48, 57, 94, 105 
matrix, 7, 29-31, 33, 42, 47 


Kronecker delta, 7, 15, 41, 53, 58, 71, 76, 77, 82, 
85-90, 93, 96, 107-109, 126, 127, 132, 135, 
136 


Laplacian, 6, 10, 137, 188, 140, 144-146, 148-154, 
164 
Length, 6, 7, 15, 22, 23, 25, 27, 29, 39, 46, 53, 76, 
83, 91, 92, 94, 96, 102, 103, 108, 110, 157 
Levi-Civita 
identity, 87 
tensor, 58, 77 
Linear 
algebra, 42, 44, 67, 74, 95, 155, 156 
combination, 113 
coordinate system, 25 
operation, 124 
transformation, 26, 27, 29, 34, 62, 112, 121 
Lorentz transformations, 23, 29, 95, 110 
Lowering operator, 52, 71, 93, 94, 97, 112-114, 126, 
168, 171-173 


Manifold, 22-24, 31, 39, 42, 46, 94 
Matrix 
algebra, 65, 66, 74, 155, 156 
notation, 156 
Metric 
space, 22 
tensor, 7, 15, 22, 23, 26, 27, 32, 39, 40, 42- 
49, 52, 53, 69-71, 76, 81, 90-99, 103, 105, 
107-110, 112-114, 118, 119, 123-126, 129, 
131-136, 142, 147, 148, 150, 167-169, 171, 
172 
Minkowski 
metric, 95, 110 
space, 23, 29 
Mixed 
derivative, 10, 11, 127, 128, 135, 168, 169, 178 
Kronecker delta, 7, 77, 82, 93, 132 
metric tensor, 7, 22, 41, 43, 91-93, 109, 124, 
125, 132 
Riemann-Christoffel curvature tensor, 168, 169, 
178 
tensor, 7, 14, 18-22, 32, 41, 43, 51, 52, 54, 63, 
65, 66, 69, 71, 77, 81, 82, 91-94, 102, 109, 
121, 122, 124, 125, 131, 132, 141, 143, 152, 
168, 169, 178 
Multiplication 
by scalar, 62, 64, 73 
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of matrices, 67, 73, 92, 156, 177 

of matrix by vector, 67, 156 

of tensors, 14, 58, 62, 64-67, 73, 74, 89, 94, 118, 
124, 131 

Mutually 

exclusive, 50 

independent, 24, 40, 44 

orthogonal, 15, 25, 26, 36, 39, 43, 44, 49, 53, 
83, 92 

perpendicular, 24-27, 55, 94 


nabla operator, 6, 137-140, 148, 146, 148, 150, 152, 
153 
Negative orthogonal transformation, 34, 48 
Non- 
Euclidean, 167 
scalar tensor, 24, 32, 64, 65, 112, 120, 121, 128, 
133, 134, 136, 146 
singular, 91, 92 
Nonlinear transformation, 26, 34 
Normal 
stress, 174 
vector, 7, 174 
Normalized vector, 6, 102 


Oblique coordinate system, 25, 26 


Order 
of derivative, 10, 11, 127, 128, 135, 168, 169, 
178 
of indices, 10, 16-21, 51, 52, 55, 64, 66, 83, 93, 


94, 109, 119, 123, 134, 141, 162 
of multiplicands, 67, 73, 124, 131, 157, 162-164 
of operators, 11, 18, 138, 162 
of tensor, 14, 16, 18, 20 
f transformations, 32 
Orthogonal, 15, 25-27, 36, 39, 43, 44, 49, 53, 83, 92 
coordinate system, 7, 24-26, 36, 39, 41, 48, 47, 
70, 71, 74, 94, 95, 103-105, 108-111, 115- 
117, 119, 133, 184, 187, 146-148, 150, 153 
transformation, 27, 33, 34, 48, 55, 78, 81 
Orthonormal 
basis set, 15, 43, 49, 53, 71, 83, 148, 150 
Cartesian, 15, 16, 22, 23, 27, 28, 30, 39, 48, 51, 
55, 66, 68, 71, 74, 83, 91, 93-95, 98, 99, 
108-111, 118, 119, 134, 177 
coordinate system, 15, 16, 20, 22, 23, 27, 28, 
30, 39, 48, 51, 55, 66, 68, 71, 74, 83, 91, 
93-95, 98, 99, 108-111, 118, 119, 134, 177 
vectors, 6, 15, 20, 49, 72, 83, 84, 99, 108 
Orthonormalized vectors, 6, 53, 72 
Outer 
multiplication, 64, 67, 73 
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product, 6, 64-68, 94, 124, 126, 131, 138 


Parallelepiped, 41, 105 
Partial 
derivative, 6, 10, 11, 30, 31, 53, 96, 112-114, 
117, 118, 120-124, 126-130, 132, 133, 135, 
168, 169 
differential operator, 11, 51, 52, 121, 138 
differentiation, 10, 83, 96, 120, 121, 124, 134, 
135 
Passive transformation, 33 
Perimeter, 161 
Permissible transformation, 31, 62, 94, 98 
Permutation 
of tensor, 68, 69, 74 
tensor, 8, 57, 76-78, 80-90, 99, 100, 102, 107, 
110, 132, 139 
Perpendicular, 6, 24-27, 39, 53, 55, 94 
Physical 
basis vector, 71, 74 
component, 69-71, 75, 137, 147, 148, 150, 153 
dimension, 25, 46, 69 
representation, 6, 39, 69, 71, 74 
Plane, 23, 25, 26, 35, 36, 46, 55, 94 
Polar 
angle, 36 
coordinate system, 8, 36, 148 
tensor, 55 
Polyad, 55 
Position vector, 6, 7, 39, 53, 102, 146, 158, 161 
Positive orthogonal transformation, 33, 34, 48 
Principle of invariance, 12, 19, 112 
Product rule, 96, 114, 121-125, 130, 132, 141, 142, 
145, 162, 163, 165, 166 
Proper 
rotation, 58 
tensor, 55 
transformation, 32-34, 48, 77, 78, 81, 107 
Pseudo 
tensor, 55-58, 63, 72, 78, 81 
vector, 55, 57, 72, 157 


Quotient rule of 
differentiation, 69 
tensor, 62, 69, 74 


Radius, 23 
Raising operator, 52, 71, 93, 94, 97, 112, 121, 126, 
141, 142, 153, 168, 171-173 
Range of index, 12-16, 18, 65, 79, 80, 160 
Rank 
-0 tensor, 9, 12, 18, 16, 19, 58, 62, 65, 67, 153, 
157 
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-1 tensor, 9, 12-14, 19, 50, 56, 58, 64, 67, 70, 
78, 140, 152, 157 
-2 tensor, 9, 10, 12-14, 16, 19, 22, 54, 56, 58— 
60, 64-68, 70-74, 76, 78, 79, 85, 86, 91, 
97, 102, 107, 110, 121, 122, 134, 136-139, 
141, 143, 149, 151-153, 156-158, 160, 161, 
172-177 
-3 tensor, 12-14, 19, 54, 60, 66, 72, 77-84, 86— 
88, 99, 107, 112 
-4 tensor, 65, 66, 72, 79, 107, 167 
of tensor, 9, 12, 14, 16, 19, 20, 42, 50-54, 58, 
59, 61-70, 72, 73, 77-79, 84, 88, 93, 97, 
107, 120, 123, 128, 130-132, 135, 137-148, 
146, 152, 157, 160, 161 
Rate of 
deformation tensor, 176, 179 
strain tensor, 7, 176, 179 
Real, 9, 11, 24, 27, 29, 35, 44, 103 
Reciprocal, 30, 31, 39, 41, 43, 44, 53, 63, 91, 92, 95 
metric tensor, 91 
Reciprocity relation, 53, 54, 72, 91 
Rectangular 
Cartesian, 25, 26, 29, 33, 34, 54, 56, 72, 92 
coordinate system, 25, 26, 29, 33, 34, 54, 56, 
62, 72, 92, 112, 129, 132, 146 
parallelepiped, 105 
Rectilinear coordinate system, 24-26, 30, 34, 35, 47, 
121, 123-125, 129, 132, 136 
Reference frame, 32 
Reflection, 33, 34, 55 
Relative 
permutation tensor, 8, 81, 99, 107, 110 
scalar, 94, 128, 136 
tensor, 7, 57, 58, 63, 66, 72, 78, 81, 89, 107, 
128, 136 
Replacement operator, 52, 82, 88, 93, 107, 126, 136 
Rheology, 175 
Ricci 
curvature scalar, 7, 172, 173, 179 
curvature tensor, 7, 172, 173, 178, 179 
theorem, 114, 115, 124-126, 135 
Riemann-Christoffel curvature tensor, 7, 22, 23, 127, 
128, 167-172, 178, 179 
Riemannian 
curvature, 23 
geometry, 22, 24, 46 
metric, 22, 94 
space, 22, 23, 46, 93, 170 
Right handed system, 41, 42, 49, 84 
Rotation, 32-34, 58, 175, 176 


Scalar, 6, 9-13, 16-20, 31, 34, 58, 59, 62-66, 68-70, 
72, 73, 94, 103, 123, 124, 128-132, 135- 
145, 147-151, 153, 157 
field, 19, 50, 129, 136, 152-154, 158 
invariant, 155, 157, 158, 177 
multiplication, 34 
operator, 137, 138, 152 
triple product, 41, 84, 100, 101, 105, 106, 110, 
111, 157, 177 
Scale factor, 7, 36, 39, 41, 48, 69-71, 95, 100, 103, 
105, 110, 111, 117-119, 146, 148, 150, 153 
Schur theorem, 23, 46 
Semi- 
circle, 26, 36 
plane, 26, 35 
Semicolon notation, 6, 10, 11, 19, 124 
Shear stress, 174 
Shifting operator, 40, 52, 93, 98, 109, 113, 114, 126, 
135, 136 
Skew-symmetric, 59, 73, 169 
Solenoidal, 160 
Sphere, 23, 25, 26, 35, 36, 46 
Spherical coordinate system, 6, 7, 10, 24-26, 28, 35, 
38, 39, 47, 48, 71, 75, 95, 110, 111, 113, 
117-119, 134, 187, 150, 152-154, 175 
Stokes theorem, 155, 160, 161, 177 
Straight line, 23, 25, 26, 35, 36, 47, 55 
Stress tensor, 8, 174, 179 
Subtraction of tensors, 62, 63, 73 
Sum rule, 132 
Summation, 11, 14-16, 20, 43, 62, 66, 113, 147 
convention, 11, 14, 15, 19, 83, 87, 137, 148, 150 
Surface, 6, 7, 12, 23-25, 35, 46, 53, 76, 94, 103-105, 
119, 160, 161, 174 
Symbolic notation, 9, 12, 19, 65, 67, 124, 126, 131, 
135, 139, 156 
Symmetric, 18, 22, 50, 59-62, 72, 73, 76, 77, 84, 
91-98, 109, 113, 116, 118, 133, 169-172, 
174-176, 178, 179 
tensor, 59-62, 73, 84, 172, 176 
Symmetry, 59, 61, 62, 73, 84, 107, 108, 119, 144, 
153, 169, 173, 178, 179 


Tangent vector, 32, 39, 43, 55, 98, 103, 129, 131 
Tensor 
algebra, 42, 62, 63, 156 
calculus, 1, 11, 12, 14, 17, 19, 22, 24, 57, 62, 65, 
67, 69, 75-77, 86, 91, 112, 121, 137, 152, 
155, 157, 161, 167, 168, 173, 177 
component, 18, 20, 55, 78, 86, 112, 123, 132, 
143, 152 
equality, 14-18, 20, 58, 62, 63, 72, 73, 77 
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expression, 14-18, 20, 56, 58, 63, 72, 87 notation, 12, 158, 160, 177 


field, 19, 21, 136, 152, 160, 161 operator, 137, 138, 140, 152 
identity, 76, 87, 89, 155, 161 triple product, 101, 102, 110, 157, 177 
multiplication, 64, 65, 73, 124, 131 Velocity 
notation, 12, 13, 17, 71, 84, 92, 99, 109, 110, gradient tensor, 6, 175, 176, 179 

137, 152, 153, 155, 157, 158, 160, 161, 173— vector, 7, 175 

177, 179 Volume, 6, 7, 39, 41, 105, 106, 111, 160 
representation, 50, 69-71, 74 Vorticity tensor, 7, 8, 176, 179 


term, 14-18, 20, 56-58, 60, 62, 63, 72, 82, 84, 
86, 120, 122-124, 126, 128, 132, 134-136, Weight of tensor, 7, 53, 57, 58, 63, 64, 66, 72, 73, 


144, 162, 164, 172 78, 81, 89, 94, 107, 128, 136 
test, 69, 74 

Torus, 23 Zero 

Total tensor, 14, 58, 62, 72, 73, 133 
derivative, 112, 130, 132, 133, 136 vector, 58 


differentiation, 112, 130, 132, 136 
stress tensor, 174 
Totally 
anti-symmetric, 61, 72, 78, 81, 84, 107 
covariant Riemann-Christoffel curvature tensor, 
167 
symmetric, 61, 72 
Trace, 7, 66, 83, 139, 156, 158, 177 
Traction vector, 7, 174 
Transformation, 7, 11-13, 15, 19, 22, 26, 27, 29-34, 
36, 40-42, 47-51, 55, 57, 58, 61, 62, 69, 
71-74, 76-78, 81, 91, 93, 94, 98, 103, 105, 
107, 109, 112, 121, 141, 153, 157 
Translation, 33, 34, 175 
Triad, 54, 65 
True 
scalar, 31, 157 
tensor, 55-58, 63, 72 
vector, 31, 55, 57, 72 


Unit 

tensor, 14 

vector, 55, 56, 84, 160 
Unity 

matrix, 53 

tensor, 14, 92, 94, 109, 175 


Vector, 6, 7, 9, 11-13, 15-17, 19, 20, 25-27, 31, 32, 

39-44, 48, 53-59, 64-74, 83, 84, 87, 91-94, 
97-103, 105, 108, 110, 111, 1138, 121-124, 
127-131, 134, 137-144, 146, 147, 149-153, 
156-158, 160, 163, 164, 168, 174, 175, 177 

algebra, 155 

calculus, 74, 137, 140, 147, 148, 152, 155, 157, 
158 

field, 19, 138, 152-154, 158, 160, 177, 178 

identity, 87, 161, 177 
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